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Preface 


This study of algebraic systems is designed to be used either as a supplement to 
current texts or as a text for a course in modern abstract algebra at the junior-senior 
level. As such, it is intended to provide a solid foundation for future study of a 
variety of systems rather than to be a study in depth of any one or more. 

The basic ingredients of algebraic systems — sets of elements, relations, operations, 
mappings —are treated in the first two chapters. The pattern established here, 

(i) a simple and concise presentation of each topic, 

(ii) a wide variety of familiar examples, 

(iii) proofs of most theorems included among the solved problems, 

(iv) a carefully selected set of supplementary exercises, 
is followed throughout the book. 

Beginning in Chapter 3 with the Peano postulates for the natural numbers, the 
several number systems of elementary algebra are constructed in turn and their 
salient properties deduced. This not only introduces the reader to a detailed and 
rigorous development of these number systems but also provides him with much 
needed practice for the deduction of properties of the abstract systems which follow. 

The first abstract algebraic system — the Group — is considered in Chapter 9. 
Co«ets of a subgroup, invariant subgroups and their quotient groups are investigated, 
and the chapter ends with the Jordan-Holder Theorem for finite groups. 

Chapters 10-11 are concerned with Rings, Integral Domains and Fields. Poly¬ 
nomials over rings and fields are then considered in Chapter 12 along with a certain 
amount of elementary theory of equations. Throughout these chapters, considerable 
attention is given to finite rings. 

Vector spaces are introduced in Chapter 13. The algebra of linear transformations 
on a vector space of finite dimension leads naturally to the algebra of matrices 
(Chapter 14). Matrices are then used to solve systems of linear equations and, thus, 
provide simpler solutions of a number of problems connected with vector spaces. 
Matrix polynomials are considered in Chapter 15 as an example of a non-commutative 
polynomial ring. The characteristic polynomial of a square matrix over a field is 
then defined. The characteristic roots and associated invariant vectors of real sym¬ 
metric matrices are used to reduce the equations of conics and quadric surfaces to 
standard form. Linear algebras are formally defined in Chapter 16 and other 
examples briefly considered. 

In the final chapter, Boolean algebras are introduced and the important applica¬ 
tion to simple electric circuits indicated. 

The author wishes to take this opportunity to express his appreciation to the 
staff of the Schaum Publishing Company, especially to Jeffrey Albert and Alan 
Hopenwasser, for their unfailing cooperation. 

Frank Ayres, Jr. 

Carlisle, Penna. 

June, 1965 
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Chapter 1 


Sets 

SETS 

Any collection of objects as (a) the points of a given line segment, (b) the lines through 
a given point in ordinary space, (c) the natural numbers less than ten, (d) the five Jones 

boys and their dog, ( e ) the pages of this book.will be called a set or class. The 

individual points, lines, numbers, boys and dog, pages.will be called elements of the 

respective sets. Generally, sets will be denoted by capital letters and arbitrary elements 
of sets will be denoted by small letters. 

Let A be a given set and let p and q denote certain objects. When p is an element of 
A, we shall indicate this fact by writing p E A; when both p and q are elements of A, 
we shall write p,q G A instead of p € A and q G A; when q is not an element of A, we 
shall write q & A. 

Although in much of our study of sets we will not be concerned with the type of 
elements, sets of numbers will naturally appear in many of the examples and problems. 
For convenience, we shall now reserve 

N to denote the set of all natural numbers 
I to denote the set of all integers 
Q to denote the set of all rational numbers 
R to denote the set of all real numbers 

Example 1: (a) 1 G N and 205 € N since 1 and 205 are natural numbers; —5 S N since i 

and —5 are not natural numbers. 

(6) The symbol € indicates membership and may be translated as “in”, “is in”, 
"are in”, "be in” according to context. Thus, “Let r G Q” may be read as 
“Let„r be in Q” and “For any p,q G I” may be read as “For any p and q 

in /”. We shall at times write n ¥* 0 G I instead of n # 0, n G I; also 

p¥=0,qGI instead of p,q € / with p ¥* 0. 

The sets to be introduced here will always be well-defined, that is, it will always be 
possible to determine whether any given object does or does not belong to the particular 
set. The sets of the first paragraph were defined by means of precise statements in words. 

At times, a set will be given in tabular form by exhibiting its elements between a pair of 

braces; for example, 

A = {a} is the set consisting of the single element a. 

B = (a, b} is the set consisting of the two elements a and b. 

C = {1,2,3,4} is the set of natural numbers less than 5. 

K = {2,4,6, ...} is the set of all even natural numbers. 

— {• • • > ~-10, —5,0,5,10,15, ...} is the set of all integers 
having 5 as a factor. 

The sets C, K, and L above may also be defined as follows: 

C = {%: x E N, x < 5} 

K = {x : x e N, x is even) 

L = {x : x € I, a; is divisible by 5} 

Here each set consists of all objects x satisfying the conditions following the colon. 

See Problem 1. 
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EQUAL SETS 

When two sets A and B consist of the same elements, they are called equal and we 
shall write A = B. To indicate that A and B are not equal, we shall write A ¥= B. 

Example 2: (i) When A = {Mary, Helen, John} and £ = {Helen, John, Mary}, then A = B. 

Note that a variation in the order in which the elements of a set are tabulated 

is immaterial. 

(ii) When A — { 2,3,4} and B = { 3,2,3,2,4}, then A = B since each element of 

A is in £ and each element of £ is in A. Note that a set is not changed by 

repeating one or more of its elements. 

(iii) When A = {1,2} and £ = {1,2,3,4}, then A ¥= B since 3 is an element of 
£ but not of A. 

SUBSETS OF A SET 

Let S be a given set. Any set A, each of whose elements is also an element of S, is 
said to be contained in S and called a subset of S. 

Example 3: The sets A = {2}, £ = {1,2,3}, and C = {4,5} are subsets of S = {1,2,3,4,5}. 
Also, D = {1,2,3,4,5} = S is a subset of S. 

The set E — {1,2,6} is not a subset of S since 6 G E but 6 & S. 

Let A be a subset of S. If A ¥• S, we shall call A a proper subset of S and write 
AcS (to be read “A is a proper subset of S” or “A is properly contained in S”). More 
often and in particular when the possibility A = S is not excluded, we shall write A C S 
(to be read “A is a subset of S” or “A is contained in S’’). Of all the subsets of a given 
set S, only S itself is improper, that is, is not a proper subset of S. 

Example 4: For the sets of Example 3 we may write A Q S, B Q S, C Q S, D £ S, E £ S. The 
precise statements, of course, are A C S, B C S, C C S, D = S, E (f. S. 

Note carefully that € connects an element and a set, while C and Q connect two 
sets. Thus, 2 e S and {2} C S are correct statements, while 2 C S and {2} S S 
are incorrect. 

Let A be a proper subset of S. Now S consists of the elements of A together with 
certain elements not in A. The totality of these latter elements, i.e. those not in A, con¬ 
stitute another proper subset of S called the complement of the subset A in S. 

Example 5: For the set S = {1,2,3,4,6} of Example 3, the complement of A = {2} in S is 
F = {1,3,4,5}. Also, £ = {1,2,3} and C = {4,5} are complementary subsets in S. 

Our discussion of complementary subsets of a given set implies that these subsets be 
proper. The reason is simply that, thus far, we have been depending upon intuition 
regarding sets; that is, we have tacitly assumed that every set must have at least one 
element. In order to remove this restriction (also to provide a complement for the improper 
subset S in S), we introduce the empty or null set 0 as the set having no elements. There 
follow readily ^ ^ j s a subset of every set S. 

(ii) 0 is a proper subset of every set S ¥* 0. 

Example 6: The subsets of S — {a, b, c] are 0, {«}, {5}, {c}, {a, 6}, {a, c}, {5, c}, and {u, 6, c}. 
The pairs of complementary subsets are: 

{o, b,c} and 0 {«. 5} and {c} 

{o,c} and {6} {6,c} and {a} 

There is an even number of subsets and, hence, an odd number of proper subsets of 
a set of 3 elements. Is this true for a set of 303 elements? of 303,000 elements? 

UNIVERSAL SETS 

If U ¥* 0 is a given set whose subsets are under consideration, the given set will 
often be referred to as a universal set. 
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Example 7: Consider the equation 

(* + l)(2x-3)(3* + 4)(*2-2)(a;Z + l) = 0 

whose solution set, that is, the set whose elements are the roots of the equation, is 
S = {—1, 3/2, —4/3, y/2, —yf2, i, —i} provided the universal set is the set of 
all complex numbers. However, if the universal set is R, the solution set is 
A — {—1, 3/2, —4/3, V2> —V2 }. What is the solution set if the universal set is Q? 
is /? is N1 

If, on the contrary, we are given two sets A — {1,2,3} and B — (4,5,6,7}, and 
nothing more, we have little knowledge of the universal set U of which they are subsets. 
For example, U might be {1,2,3, .. .,7}, { x: x G N, x ^ 1000}, N, I, .... Nevertheless, 
when dealing with a number of sets A, B,C, .. ., we shall always think of them as subsets 
of some universal set U not necessarily explicitly defined. With respect to this universal set, 
the complements of the subsets A, B, C, ... will be denoted by A', B\ C’, ... respectively. 


INTERSECTION AND UNION OF SETS 

Let A and B be given sets. The set of all elements which belong to both A and B is 
called the intersection of A and B. It will be denoted by A n B (read either as “the inter¬ 
section of A and B” or as “A cap B"). Thus, 

A C\ B = {x: x G A and x G B) 

The set of all elements which belong to A alone or to B alone or to both A and B is 
called the union of A and B. It will be denoted by A U B (read either as “the union of 
A and B” or as “A cup B”). Thus, 

A U B = {x: x G A alone or xGB alone or xGACiB) 

More often, however, we shall write 

Akj B = {x: x G A or x G B} 

The two are equivalent since every element of A n B is an element of A. 

Example 8: Let A = {1,2,3,4} and B = {2,3,5,8,10}; then A uB = {1,2,3,4,6,8,10} and 
A n,B - {2,3}. See also Problems 2-4. 

Two sets A and B will be called disjoint if they have no element in common, that is, if 
A B = (ft. In Example 6, any two of the sets {a}, {6}, {c} are disjoint; also the sets 
{a,6} and {c}, the sets {a,c} and {6}, and the sets {6,c} and {a} are disjoint. 

VENN DIAGRAMS 

The complement, intersection, and union of sets may be pictured by means of Venn 
diagrams. In the diagrams below the universal set U is represented by points (not indi¬ 
cated) in the interior of a rectangle, and any of its non-empty subsets by points in the 
interior of closed curves. (To avoid confusion, we shall agree that no element of U is 
represented by a point on the boundary of any of these curves.) In Fig. l-l(a), the subsets 
A and B of U satisfy A C B; in Fig. 1-1(6), A n B = 0; in Fig. l-l(c), A and B have at 
least one element in common so that A n B 0. 



(*) (c) 

Fig.l-l 
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Suppose now the interior of U, except for the interior of A, in the diagrams above be 
shaded. In each case, the shaded area will represent the complementary set A' of A in 17. 

The union A U B and the intersection A n B of the sets A and B of Fig. l-l(c) are 
represented by the shaded area in Fig. l-2(a) and (b) respectively. In Fig. l-2(a), the 
unshaded area represents (A U B)', the complement of A U B in U; in Fig. l-2(b), the 
unshaded area represents (A n B)'. From these diagrams, as also from the definitions of 
U and n, it is clear that AuB = SuA and A n B = B n A. 

See Problems 5-7. 



(a) ( b) 

Fig. 1-2 


OPERATIONS WITH SETS 

In addition to complementation, union, and intersection, which we shall call operations 
with sets, we define: 

The difference A — B, in that order, of two sets A and B is the set of all elements of A 
which do not belong to B, i.e., 

A -B = {*: x € A, x & B) 

In Fig. 1-3, A - B is represented by the shaded area 
and B - A by the cross-hatched area. There follow 
A-B = AflB' = B'- A' 

A - B = (3 if and only if A Q B 

A - B = B - A if and only if A = B 

A — B = A if and only if A n B = 0 

Example 9: Prove: (o) A — B — A C\ B' — B' — A'\ (6) A — B = 0 if and only if A C B; 

(c) A-B - A if and only if A n B = 0. 

(a) A-B = {x:xGA,x^B} = {*: * e A and x <= B’) = A n B' 

= {x: x<£A', xGB’} = B' - A' 

(b) Suppose A—B = 0. Then, by (a), A n B' = 0, i.e., A and B' are disjoint. 

Now B and B' are disjoint; hence, since B U B' = U, we have A C B. 

Conversely, suppose A C B. Then A n B' = 0 and A — B = 0. 

(c) Suppose A — B = A. Then A n B' = A, i.e., A c B'. Hence, by (ft), 

A n (B')' = A n B = 0 

Conversely, suppose A n B = 0. Then A — B' = 0, A C B', A n B' = A 
and A—B — A. 

In Problems 5-7, Venn diagrams have been used to illustrate a number of properties 
of operations with sets. Conversely, further possible properties may be read out of these 
diagrams. For example, Fig. 1-3 suggests 

(A — B) U (B — A) = (AUB)-(AnB) 

It must be understood, however, that while any theorem or property can be illustrated by 
a Venn diagram, no theorem can be proved by the use of one. 



Fig. 1-3 
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Example 10: Prove (A— B)u(B — A) = (AuB) — (AnB). 

The proof consists in showing that every element of (A — B) U (B — A) is an 
element of (AuB)-(AnB) and, conversely, every element of (AuB) — (AnB) 
is an element of (A — B) U (B — A). Each step follows from a previous definition 
and it will be left for the reader to substantiate these steps. 

Let x G (A - B) u (B - A); then x & A-B of x G B- A. If a; e A - B, 
then i£A but x € B; if xGB — A, then xGB but * £ A. In either case, 
xGAuB but x«AnB. Hence, * £ (AuB)-(AnB) and 
(A-B)u(B-A) c (AuB)-(AnB) 

Conversely, let * G (AuB) - (AnB); then i£AuB but * G A n B. Now 
either x G A but *«B, i.e., x G A-B, or xGB but x G A, i.e., x G B - A. 
Hence, x G (A - B) u (B - A) and (AuB) - (AnB) c (A - B) u (B - A). 

Finally, (A-B)u(B-A) c (AuB) - (AnB) and 
(AuB)-(AnB) C (A-B)U(B-A) imply (A-B)u(B-A) = (AuB)-(AnB) 

For future reference we list below the more important laws governing operations with 
sets. Here the sets A,B,C are subsets of U the universal set. 


LAWS OF OPERATIONS WITH SETS 

(1.1) (A')' = A 


0.2) 

0.3) 

0' = U 

A- A = 

0, A - 0 = A, 

(1.2') 

A - B = AnB' 

U' = 0 

0.4) 

A u 0 = 

A 

0.4') 

A nU = A 

05) 

A u U = 

U 

05') 

A n 0 =0 

0.6) 

A U A = 

A 

0.6') 

A n A = A 

0.7) 

c 

a* 

ii 

U 

0.7') 

A n A' = 0 


___ 

0.8) 

(AuB) u C = A u (BuC) 

Associative Laws 

OA') 

(AnB) n C = A n (BnC) 

0.9) 

AuB = B u A 

Commutative Laws 

0.9') 

AnB = B n A 


0.10) 

Au(BnC) = (AuB)n(AuC) 

Distributive Laws 

0.10) 

A n (BuC) = (AnB) u 

(AnC) 

0.11) 

0.12) 

(AuB)' = A'nB' 

A - (BuC) = (A-B)n(A-C) 

De Morgan’s Laws 

0.11') 

0.12') 

(AnB)' = A'uB' 

A - (BnC) = (A-B) i 

J(A-C) 


See Problems 8-16. 


THE PRODUCT SET 

Let A — {a, 6} and B — {b,c,d}. The set of distinct ordered pairs 
C = {(a,b), (a,c), (a,d), (b,b),(b,c), (b,d)} 

in which the first component of each pair is an element of A while the second is an element 
of B, is called the product set C = Ax B (in that order) of the given sets. Thus, if A 
and B are arbitrary sets, we define 

A X B = {(x,y) : x G A, y G B} 
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Example 11: Identify the elements of X — {1,2,3} 
as the coordinates of points on the x-axis 
(see Fig. 1-4), thought of as a number 
scale, and the elements of Y = {1,2,3,4} 
as the coordinates of points on the j/-axis, 
thought of as a number scale. Then the 
elements of X x Y are the rectangular 
coordinates of the 12 points shown. 
Similarly, when X = Y = N, the set 
X X Y are the coordinates of all points 
in the first quadrant having integral 
coordinates. 


Fig. 1-4 


MAPPINGS 

Consider the set H = {hi,hi,h 3 , .. ,,h H } of all houses on a certain block of Main Street 
and the set C = {Ci, c 2 , c 3 , .. .,c 3 »] of all children living in this block. We shall be con¬ 
cerned here with the natural association of each child of C with the house of H in which 
the child lives. Let us assume that this results in associating with h 2 , c 2 with h 5 , c 3 with 
hi, c\ with hi, c s with hg, .... c 33 with h 3 . Such an association of or correspondence between 
the elements of C and H is called a mapping of C into H. The unique element of H 
associated with any element of C is called the image of that element (of C) in the mapping. 

Now there are two possibilities for this mapping: (1) every element of H is an image, 
that is, in each house there lives at least one child; (2) at least one element of H is not an 
image, that is, in at least one house there live no children. In the case (1), we shall call 
the correspondence a mapping of C onto H. Thus, the use of ‘onto’ instead of ‘into’ calls 
attention to the fact that in the mapping every element of H is an image. In the case (2), 
we shall call the correspondence a mapping of C into, but not onto, H. Whenever we write 
“a is a mapping of A into B” the possibility that a may, in fact, be a mapping of A onto B 
is not excluded. Only when it is necessary to distinguish between cases will we write either 
“a is a mapping of A onto B” or “a is a mapping of A into, but not onto, B’’. 

A particular mapping a of one set into another may be defined in various ways. For 
example, the mapping of C into H above may be defined by listing the ordered pairs 

a = {(ci, hi), {ci, hi), (c 3 , hi), (c 4 , hi), (c 5 , hg), ..., (c 3 «, hg)} 

It is now clear that « is simply a certain subset of the product set C x H of C and H. Hence, 
we define 

A mapping of a set A into a set B is a subset of A x B in which each element of A 
occurs once and only once as the first component in the elements of the subset. 

In any mapping a of A into B, the set A is called the domain and the set B is called the 
co-domain of a. If the mapping is “onto”, B is also called the range of «; otherwise, the 
range of « is the proper subset of B consisting of the images of all elements of A. 

A mapping of a set A into a set B may also be displayed by the use of -*• to connect 
associated elements. 

Example 12: Let A = {o, b, c} and B — {1,2}. Then 

a: a-»l, 6-»2, c->2 

is a mapping of A onto B (every element of B is an image) while 
p : 1 -» a, 2 -» 6 

is a mapping of B into, but not onto, A (not every element of A is an image). 
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In the mapping a, A is the domain and B is both the co-domain and the range. 
In the mapping p, B is the domain, A is the co-domain, and C = {a, 6} C A is the 
range. 

When the number of elements involved is small, Venn diagrams may be used 
to advantage. Fig. 1-5 below displays the mappings a and p of this example. 



<* P 

Fig. 1-5 


A third way of denoting a mapping is discussed in 

Example 13: Consider the mapping a of N into itself, that is, of N into N, 
a: 1 -» 3, 2 -» 5, 3 - 7, 4 -» 9, ... 

or, more compactly, a : « -» 2n + 1, n G N 

Such a mapping will frequently be defined by 

a : la = 3, 2a = 5, 3a = 7, 4a — 9, ... 

or, more compactly, by a : «a = 2n + 1, n G N 

Here N is the domain (also the co-domain) but not the range of the mapping. The 
range is the proper subset M of N given by 

M ■ - { x : x = 2n + 1, n G N} 
or M = {*:*€ N, * is odd} 

Mappings of a set X into a set Y, especially when X and Y are sets of numbers, are 
better known to the reader as functions. For instance, defining X = N and Y = M in 
Example 13 and using / instead of a, the mapping (function) may be expressed in func¬ 
tional notation as 

(i) y = f(x) = 2x + 1 

We say here that y is defined as a function of x. It is customary nowadays to distinguish 
between “function” and “function of”. Thus, in the example, we would define the func¬ 
tion f by 

/ = {(«, y) ■ y = 2x + i, x e X} 
or f = {(x, 2x + l): x G X] 

that is, as the particular subset of IxF, and consider (i) as the “rule” by which this 
subset is determined. Throughout much of this book we shall use the term mapping rather 
than function and, thus, find little use for the functional notation. 

Let a be a mapping of A into B and j8 be a mapping of B into C. Now the effect of a 
is to map a G A into aa G B and the effect of B is to map act G B into (a«)/3 G C. The 
net result of applying a followed by p is a mapping of A into C which we define by 

a/3 : a(a/3) = (««)/?, a G A 

We shall call «/3 the product of the mappings « and /? in that order. Unfortunately, the 
notation has not been standardized so that a/3 is sometimes used to denote the effect of the 
mapping (3 followed by the mapping «. Note also that we have used the term product 
twice in this chapter with meanings quite different from the familiar product, say, of two 
integers. This is unavoidable unless we keep inventing new names. 
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Example 14: Let A = {a, b,c}, B = {d, e}, C = { f,g,h,i } and 

a : Ha = d, ba = e, ca = e 

P : dp = /, ep = h 

Then a p : a(ap) = (a a )p = dp = f, b( a p) = ep = h, c( a p) = h 



Fig. 1-6 

ONE-TO-ONE MAPPINGS 

A mapping a -*a' of a set A into a set B is called a one-to-one mapping of A into B 
if the images of distinct elements of A are distinct elements of B; if, in addition, every 
element of B is an image, the mapping is called a one-to-one mapping of A onto B. In the 
latter case, it is clear that the mapping a-* a' induces a mapping a' -* a of B onto A. 
The two mappings are usually combined into a** a' and called a one-to-one correspondence 
between A and B. 

Example 15: (o) The mapping a of Example 14 is not a one-to-one mapping of A into B (the dis¬ 

tinct elements b and c of A have the same image). 

(6) The mapping p of Example 14 is a one-to-one mapping of B into, but not onto, 
C (ff S C is not an image). 

(c) When A = {a,b,c,d} and B = {p,q,r,s}, 

(i) aj : a,*-* p, b *r± q, c ** r, d 8 

and (ii) « 2 : a**r, b*-*p, c*+q, d<r±s 

are examples of one-to-one mappings of A onto B. 

Two sets A and B are said to have the same number of elements if and only if a 
one-to-one mapping of A onto B exists. A set A is said to have n elements if there exists 
a one-to-one mapping of A onto the subset S = {1,2,3,...,«} of N. In this case, A is 
called a finite set. 

The mapping a : na = 2n, nGN 

of N onto the proper subset M = {x: x G N, x is even} of N is both one-to-one and onto. 
Now N is an infinite set\ in fact, we may define an infinite set as one for which there exists 
a one-to-one correspondence between it and one of its proper subsets. 

An infinite set is called countable or denumerable if there exists a one-to-one corre¬ 
spondence between it and the set N of all natural numbers. 


ONE-TO-ONE MAPPING OF A SET ONTO ITSELF 

Let a: x**x + 1, p: x <-» Zx, y: x ** 2x - 5, 8 : x <-> x - 1 

be one-to-one mappings of R onto itself. Since for any x G R 
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Xap - (X + 1)P = 3(x + 1) 


while 

xfia -- 

= (3x)a = 3x + l 

we see that 

(i) 

aft ¥= fiat 

However, 

XyS = 

(2x — 5)8 - 2# — 6 

and 

Xa(yS) - (X + l)y8 = 2(x + 1) - 6 = 2x 

while 

Xay = 

(x + l)y = 2x - 3 

and 

x( ay )S = 

(2x — 3) — 1 = 2x-4 

Thus, 

(ii) 

a(yS) = (ay) 8 

Now 

XaS 

= (x +1)8 = X 

and 

X8a 

= (X-I)a = X 


that is, a followed by 8 (also, 8 followed by a) maps each x G R into itself. Denote by 3, 
the identity mapping, 

3 : x +* x 

Then (iii) «S = 8« = 3 

that is, 8 undoes whatever a does (also, a undoes whatever 8 does). In view of (iii), 8 is 
called the inverse mapping of a and we write 8 = a -1 ; also, a is the inverse of 8 and we 
write a = 8 -1 . . „ ,, 

See Problem 18. 

In Problem 19, we prove 

Theorem I. If a is a one-to-one mapping of a set S onto a set T, then a has a unique 
inverse, and conversely. 

In Problem 20, we prove 

Theorem II. If « is a one-to-one mapping of a set S onto a set T and (3 is a one-to-one 
mapping of T onto a set U, then («/J) _1 = fi~ l • «->. 


Solved Problems 

1. Exhibit in tabular form: (a) A = {a: a e N, 2 < a < 6}, (b) B = {p: p e N, p < 10, 
p is odd}, (c) C = {x: x G I, 2x 2 + x — 6 = 0}. 

(а) Here A consists of all natural numbers (a G N) between 2 and 6; thus, A = {3,4, 5}. 

(б) B consists of the odd natural numbers less than 10; thus, B = {1,3,5, 7,9}. 

(c) The elements of C are the integral roots of 2z 2 + x - 6 = (2x - 3)(x + 2) = 0; thus C = {-2}. 
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2. Let A = {a,b,c,d }, B - { a,c,g }, C — { c,g,m,n,p }. Find: 

A U B = {a, b, c, d,g}, A U C = {a,b,c,d,g,m,n,p}, BuC = {a, c,g,m, n,p}; 

A C\ B = {a, c}, AnC = {c}, B C C = {c,g}; An (BuC) = {a,c}; 

(AnB) U C = {a,c,g,m,n,p}, (AuB)nC= {c,g}, (AflB)U(An C) = An (BuC). 

3. Consider the subsets K = {2,4,6,8}, L = {1,2,3,4}, M = {3,4,5,6,8} of U = 
{1,2,3, ..., 10}. (a) Exhibit K',L',M' in tabular form. (6) Show that (K U L)' = 
K'nL'. 

(a) K’ = {1,3,5,7,9,10}, V = {5,6,7,8,9,10}, M' = {1,2,7,9,10}. 

(5) K U L = {1,2,3,4,6,8} so that (K U L)' = {6,7,9,10}. Then K’ n V = {5, 7, 9,10} = (X U L)’. 

4. For the sets of Problem 2, show: (a) (AUB)UC = AU(BU C), (6) (A n B) n C = 

An(Bn C). 

(a) Since A U B = {a, b, e,d,g) and C = {c,g,m,n,p}, we have (AuB)uC = {a, b, c, d,g,m,n, p}. 
Since A = {a,b,e,d} and B U C = {a,e,g,m,n,p }, we have A U (B U C) = {a,b,c,d,g,m,n,p} = 
(AuBju C. 

(6) Since A n B = {a, c}, we have (A n B) n C = {c}. Since BnC = {c,g}, we have 
An(BnC) = {c} = (AnB)nC. 

5. In Fig. l-l(c), let C = A n B, D = AnB', E = B n A' and F = (A U £)'. Verify: 

(a) (A U B)’ = A'n B’, ( b) (A n By = A' U 2?'. 

(a) A'nB' = (S u F) n {£> u F) = F = (Au B)' 

(b) A' u B' = (BuF) u (BuF) = (Bu F) u D = C' = (AnB)' 

6. Use the Venn diagram of Fig. 1-7 to verify: 

(а) E - (AnB) nC' (c) AuBnCis ambiguous 

(б) AUBUC = (AUB)UC = AU(BUC) (d) A' n C = G U L 

(а) A n B = D U B and C’ = B U F U G U L; then 

(AnB)nC' = B 

(б) AuBuC = EuFuGuDuHuJuK. Now 

AuB = BufuGuBuBu/ 
and C — DuHuJuK 

so that 

(AuB)uC = EuFuGuDuHuJuK 
=AuBuC 

Also, BuC = EuGuDuHuJuK and A=EuFuDuH 

sothat Au(BuC) =EufuGuDuflu)uK = AuBuC 

(c) A U B n C could be interpreted either as (A U B) n C or as A U (B n C). Now (A U B) n C = 
D U H U J, while Au(BnC)=Au(Bul) - Aul Thus, AuBnC is ambiguous. 

(d) A' = GuJuKuL and C' — E u F U G U L; hence, A'flC 1 = Gut. 

7. Let A and B be subsets of U. Use Venn diagrams to illustrate: A n B' = A if and 
only if A n B = 0. 

Suppose A n B = 0 and refer to Fig. 1-1(6). Now A C B'; hence AnB' = A. 

Suppose AnB 5 s 0 and refer to Fig. l-l(c). Now A B’\ hence A n B’ ¥= A. 

Thus, A n B' = A if and only if A n B = 0. 

8. Prove: (AuB)UC = AU(BUC). 

Let * G (A u B) u C. Then x&AuB or x G C, so that xGA or xGB or xGC. When 

x e A, then * € A U (B U C); when xGB or x G C, then x G B U C and hence *£AU(BuC). 

Thus (A uB) u C c Au(BuC). 
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Let *e4u(BuC). Then x G A or x G B U C, so that x G A or or x G C. When 

i£4 or x & B, then x G A U B and hence x € (A U B) U C; when 16 C, then i 6 (A U B) U C. 
Thus A u (B U C) c (A u B) u C. 

Now (AuB) u C £ A u (BuC) and A u (BuC) c (AuB) u C imply (AuB) uC = Au (BuC) 
as required. Thus, AuBuC is unambiguous. 

9. Prove: (AflB)nC = An(BflC). 

Let x G (A n B) n C. Then x G A n B and x G C, so that x G A and x G B and x G C. Since 

x G B and x G C, then xGBnC; since x G A and xGBflC, then x G A n (B n C). Thus 

(AnB)nC c An(BnC). 

Let xGAn(BnC). Then x G A and xGBnC, so that x G A and x G B and x G C. Since 

xGA and x G B, then xGAnB; since xGAnB and xGC, then xG(AnB)nC. Thus 

An(BnC) £ (A nB) n C and (A n B) n C = A n (B n C) as required. Thus, A n B n C is unam¬ 
biguous. 


10. Prove: An(BllC) = (AflB)U(An C). 

Let xeAn(BuC). Then xGA and x G B U C (xGB or xGC), so that x G A and xGB 
or x G A and x G C. When x G A and x G B, then x G A n B and so x G (A n B) U (A n C); 
similarly, when x G A and x G C, then xGAnC and so x G (An B) U (AnC). Thus A n (BuC) £ 
(AnB) U (AnC). 

Let x G (A n B) U (A n C), so that xGAnB or xGAnC. When xGAnB, then x G A 
and x G B so that x G A and xGBuC; similarly, when xGAnC, then x G A and x G C so that 
x G A and xGBuC. Thus xGAn(BuC) and (AnB)u(AnC) £ An(BuC). Finally 
A n (B U C) = (A n B) u (A n C) as required. 


11. Prove: (A U B)’ = A' n B'. 

Let x G (A U B)'. Now x G A U B, so that x G A and x G B. Then x G A' and x G B', that is, 
x G A' n B'; hence (A U B)' £ A' n B'. 

Let x G A' n B'. Now x G A' and xGB', so that x G A and x G B. Then x G A U B, so that 
x G (A U B)'; hence A'n B' £ (AuB)'. Thus (AuB)' = A'nB' as required. 


12. Prove: (AnB) U C = (AuC) n (BUC). 

Cu(AnB) = (CuA)n(CuB) by (1.10), Page 5. 
Then (A n B) U C = (A U C) n (B U C) by (1.9), Page 5. 


13. Prove: A-(BUC) = (A — B) n (A — C). 

Let xGA-(BuC). Now x G A and xGBuC, that is, x G A but x G B and x G C. Then 

x G A - B and x G A - C, so that xG(A-B)n(A-Q and A - (B U C) £ (A - B) n (A - C). 

Let x G (A — B) n (A — C). Now x G A — B and x G A — C, that is, x G A but x G B and x G C. 

Then x G A but x G B U C, so that x G A - (B U C) and (A - B) n (A - C) £ A - (B u C). Thus 

A — (B u C) = (A — B) n (A — C) as required. 


14. Prove: (AuB)nB' = A if and only if A n B = 0. 

Using (1.10') and (1.7'), Page 5, we find 

(AuB)nB' = (AnB') u (BnB') = AnB' 
We are then to prove: A n B’ = A if and only if A n B = 0. 

(а) Suppose A n B = 0. Then A £ B' and A n B' = A. 

(б) Suppose A n B' = A. Then A £ B' and A n B = 0. 

Thus, (AuB)nB' = A if (by (o)) and only if (by ( b)) A n B = 0. 
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15. Prove: X C Y if and only if Y' c X’. 

(i) Suppose XcY. Let y' e Y'. Then y' G X since y' & Y; hence, y' G X' and Y’cX'. 

(ii) Conversely, suppose Y' C X'. Now, by (i), (X')' c (F')'; hence, XcY as required. 


16. Prove the identity (A — B)U(B- A) = (A u B) - (A n B) of Example 10 using the 
identity A - B = A n B' of Example 9. 


We have 

(A-B)u(B-A) 


(A n B') u (Bn A') 

[(A n B') u B] n [(A n B') u A'] 

[(A u B) n (B' u B)] n [(A u A') n (B' u A')] 
[(AuB) n 17] n [17 n (B'uA')] 

(AuB)n (B'uA') 

(AuB) n (A'uB') 

(A u B) n (A n B)' 

(A u B) - (A n B) 


by (1.10), Page 5 
by (1.10) 
by (1.7) 
by (1.4') 
by (1.9) 
by (1.11') 


17. Show that any two line segments have the same number of points. 

Let the line segments be AB and A'B' of Fig. 1-8. We are 
to show that it is always possible to establish a one-to-one 
correspondence between the points of the two segments. De¬ 
note the intersection of AB' and BA' by P. On AB take any 
point C and denote the intersection of CP and A'B' by C'. 

The mapping 

C -* C' 

is the required correspondence, since each point of AB has a 
unique image on A'B' and each point of A'B' is the image of 
a unique point on AB. 

18. Prove: (a) x-*x + 2 is a mapping of N into, but not onto, N. (b) x-*Sx-2 is a 
one-to-one mapping of Q onto Q, (c) x -> x 3 - 3* 2 - x is a mapping of R onto R but is 
not one-to-one. 

(а) Clearly x + 2 G N when x G N. The mapping is not onto since 2 is not an image. 

( б ) Clearly 3x — 2 € Q when x G Q. Also, each r G Q is the image of x — (r + 2)/3 G Q. 

( e) Clearly x 3 — 3a ; 2 — xGR when x G R. Also, when rGR, x 3 — 3x 2 — x~r always has a real 
root x whose image is r. When r = —3, x s — 3x 2 — x = r has 3 real roots x = —1,1,3. Since 
each has r = —3 as image, the mapping is not one-to-one. 

19. Prove: If a is a one-to-one mapping of a set S onto a set T, then a has a unique inverse 
and conversely. 

Suppose a is a one-to-one mapping of S onto T; then for any s G S, we have 
s -» sa = t GT 

Since t is unique, it follows that a induces a one-to-one mapping 
P : tp ->• s 

Now s(afi) = (sa)p = tp — 8 ; hence, aP = Jj and p is an inverse of a. Suppose this inverse is not 
unique; in particular, suppose p and y are inverses of a. Since 

ap = pa. - S and «Y = Y“ = 3 



it follows that 
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Pay = p(tey) = p • $ = p 
and Pay = (Pa)y - Jj‘y - y 

Thus p = y; the inverse of a is unique. 

Conversely, let the mapping a of S into T have a unique inverse a -1 . Suppose for s t , s 2 GS, 
with Sj # s 2 , we have Sj« = s 2 a- Then (sja)® -1 = (s 2 a)a~ 1 , so that s t (a • a~') = s 2 (a-a~ l ) and 
Sj = s 2 , a contradiction. Thus a is a one-to-one mapping. Now, for any t G T, we have (ta _1 )a = 
t(a~ l • a) = t • = t; hence, t is the image of s = fa -1 G S and the mapping is onto. 

20. Prove: If a is a one-to-one mapping of a set S onto a set T and /3 is a one-to-one 
mapping of T onto a set U, then (a/?) -1 = /?”* • a -1 . 

Since (aP)(p~ l • «-*) = a(0 • p -1 )* -1 = a* a -1 = t <J, is an inverse of <*/?. By 

Problem 19 such an inverse is unique; hence, (a/3) _1 = /3 _1 • a -1 . 


Supplementary Problems 

21. Exhibit each of the following in tabular form: 

(а) the set of negative integers greater than —6, 

(б) the set of integers between —3 and 4, 

(c) the set of integers whose squares are less than 20, 

( d) the set of all positive factors of 18, 

(e) the set of all common positive factors of 16 and 24, 

(/) {p : p G N, p 2 < 10} 

(p) {b : 6 G N, 3 — 6 — 8} 

(h) {*: x G /, 3« 2 + 7* + 2 = 0} 

(i) {x: x G Q, 2*2 + 5* + 3 = 0} 

Partial answer: (a) (-5,-4,-3,-2,-1}, ( d) {1,2,3,6,9,18}, (/) {1,2,3}, (h) {-2} 

22. Verify: (a) {x : * G N, x < 1} = 0, (6) {* : * G /, 6*2 + 5 * - 4 = 0} = 0. 

23. Exhibit the 15 proper subsets of S = { a,b,e,d }. 

24. Show that the number of proper subsets of S = {(tj, a 2 , ..., o n } is 2" — 1. 

25. Using the sets of Problem 2, verify: 

(a) (A u B) u C = A u (B u O, (6) (A n B) n C = A n (B n C), (c) (A u B) n C * A u (B n C). 

26. Using the sets of Problem 3, verify: (0) (K')’ = K, (b) ( K n L)' — K' U L', (c) (Kulu M)' = 

K'nL'n M', (d) K n (L u M) = (K n L) u (K n M). 

27. Let “n \ m” mean “n is a factor of m”. Given A = {* : * G N, 3 | *} and B = {* : * e N, 5 | *}, 
list 4 elements of each of the sets A', B', A U B, A n B, A U B', A n B', A' U B' where A' and B' 
are the respective complements of A and B in N. 

28. Prove those laws of (1.8)-(1.12'), Page 5, which were not treated in Problems 8-13. 
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29. Let A and B be subsets of a universal set 17. Prove: 

(а) A U B = A n B if and only if A = B, 

(б) A n B = A if and only if A c B, 

(c) (A n B') U (A' nB) = A U B if and only if A n B = 0. 

30. Given n(V) = 692, n(A) = 300, n(B) = 230, n(C) = 370, n(A nB) = 150, n(A n C) = 180, 

n(B nC) = 90, n(A nB'n C') = 10 where n(S) is the number of distinct elements in the set S, find: 

(а) K(AnBnC) = 40 (c) «(A'nB'nC') = 172 

(б) n(A' nBnC') = 30 ( d) w((A n B) U (A n C) U (B n C)) = 340 

31. Given the mappings a : n -* n 2 + 1 and p : n -» 3m + 2 of N into N, find: aa — n* + 2 n 2 + 2, pp, ap = 
3 n 2 + 5, and Pa. 

32. Which of the following mappings of I into I: 

(a)x^x + 2, (b) x -»3», (c) x -»»*, (d)*-»4-*, (e)x-*x«, (/)*-*»»-* 

are (i) mappings of / onto /, (ii) one-to-one mappings of / onto /? Aws. (i), (ii); (o), (d) 

33. Same as Problem 32 with I replaced by Q. Ans. (i), (ii); (a), (6), (d) 

34. Same as Problem 32 with I replaced by R. Ans. (i), (ii); (a), (6), (d), (e) 

35. (a) If E is the set of all even positive integers, show that * -» * +1, x & E is not a mapping of E 

onto the set F of all odd positive integers. 

(6) If E* is the set consisting of zero and all even positive integers (i.e., the non-negative integers), 
show that x -*• * + 1, * € E* is a mapping of E* onto F. 

36. Given the one-to-one mappings 

3: 13 - 1 , 23 = 2 , 33=3, 4J= 4 

a: la = 2, 2a = 3, 3a = 4, 4a = 1 

P : ip = 4, 2 p =1, 3p = 2, 4/5=3 

y : ly =3, 2y = 4, 3y = 1, 4y = 2 

S : 18 = 1, 28 = 4, 3S = 3, 45 = 2 

of S = {1,2, 3,4} onto itself, verify: 

(a) a p = Pa = 3, hence p = a -1 ; (b) ay = ya = P; ( e) aS * Sa; ( d) a 2 = aa = y; (e) y 2 = 3, 

hence y -1 = y; (/) a 4 = 3, hence a 3 = a -1 ; (g) (a 2 ) -1 = (a -1 ) 2 . 




Chapter 2 


Relations and Operations 

RELATIONS 

Consider the set P = {a,b,c,.. ,,t) of all persons living on a certain block of Main 
Street. We shall be concerned in this section with statements such as “a is the brother 
of p”, “c is the father of g”, ..., called relations on (or in) the set P. Similarly, “is 
parallel to”, “is perpendicular to”, “makes an angle of 45° with”, . .., are relations on 
the set L of all lines in a plane. 

Suppose in the set P above that the only fathers are c, d, g and that 
c is the father of a,g,m,p,q 
d is the father of f 
g is the father of h,n 

Then, with % meaning “is the father of”, we may write 

c%a, c%g, c%m, c%p, e%q, d%f, g%h, g%n 

Now c%a may be thought of as determining an ordered pair, either (a,c) or ( c ,o), of the 
product set PxP. Although both will be found in use, we shall always associate 
c%a with the ordered pair (a,c) 

With this understanding, % determines on P the set of ordered pairs 
(o,c), (g, c), (to, c), ( p,c ), (qyc), (f,d), ( h,g ), (n, g) 

As in the case of the term function in Chapter 1, we define this subset of P x P to be the 
relation %. Thus, 

A relation % on a set S (more precisely, a binary relation on S since it will be a 
relation between pairs of elements of S) is a subset of S x S. 

Example 1: (a) Let S = (2,3,5,6} and let % mean “divides”. 

Since 2<*2, 2^6, 3^3, 3^6, 5^5, 6^6, we have 

% = {(2,2), (6,2), (3,3), (6,3), (5,5), (6,6)> 

(6) Let S = {1,2,3, ..., 20} and ‘K mean “is three times”. 

Then 3<ft 1, 6<^ 2, 9 % 3, 12 % 4, 15^ 5, 18 % 6 and 

% = {(1,3), (2,6), (3,9), (4,12), (5,15), (6,18)} 

(c) Consider % = {(x, y): 2x — y = 6, x e R}. Geometrically, each (x, y) e % is 
a point on the graph of the equation 2x - y = 6. Thus, while the choice 

c^a means (a, c) G % rather than (c, a)£^ 

may have appeared strange at the time, it is now seen to be in keeping with the 
idea that any equation y = f(x ) is merely a special type of binary relation. 


PROPERTIES OF BINARY RELATIONS 

A relation % on a set S is called reflexive if a % a for every a e S. 
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Example 2: (a) Let T be the set of all triangles in a plane and % mean “is congruent to”. Now 

any triangle t £ T is congruent to itself; thus, t%t for every (Gf, and % is 
reflexive. 

(6) For the set T let % mean “has twice the area of”. Clearly, t^t and % is not 
reflexive. 

A relation % on a set S is called symmetric if whenever a%b then b%a. 

Example 3: (a) Let P be the set of all persons living on a block of Main Street and % mean 

“has the same surname as”. When x G P has the same surname as jGP, 
then y has the same surname as x; thus, x%y implies y%x and % is symmetric. 

(6) For the same set P, let ‘R mean “is the brother of” and suppose x%y. Now 
y may be the brother or sister of x; thus, x%y does not necessarily imply y%x 
and % is not symmetric. 

A relation % on a set S is called transitive if whenever a%b and b%c then a%c. 

Example 4: (a) Let S be the set of all lines in a plane and % mean “is parallel to”. Clearly, 

if line a is parallel to line 6 and if 6 is parallel to line c, then a is parallel to c 
and % is transitive. 

( b ) For the same set S, let % mean “is perpendicular to”. Now if line a is per¬ 
pendicular to line b and if 6 is perpendicular to line c, then a is parallel to c. 
Thus, % is not transitive. 


EQUIVALENCE RELATIONS 

A relation % on a set S is called an equivalence relation on S when % is (i) reflexive, 
(ii) symmetric, and (iii) transitive. 

Example 5: The relation “=” on the set R is undoubtedly the most familiar equivalence relation. 


Example 6: Is the relation “has the same surname as” on the set P of Example 3 an equivalence 
relation? 

Here we must check the validity of each of the following statements involving 
arbitrary x,y,z&P: 

(i) x has the same surname as x. 

(ii) If x has the same surname as y, then y has the same surname as x. 

(iii) If x has the same surname as y and if y has the same surname as z, then x 
has the same surname as z. 

Since each of these is valid, “has the same surname as” is (i) reflexive, (ii) sym¬ 
metric, (iii) transitive and, hence, is an equivalence relation on P. 


It follows from Example 3 (b) that “is the brother of” is not symmetric and, hence, 
is not an equivalence relation on P. _ „ 

See Problems 1-3. 


EQUIVALENCE SETS 

Let S be a set and % be an equivalence relation on S. If a G S, the elements y G S 
satisfying y%a constitute a subset, [a], of S, called an equivalence set or equivalence class. 
Thus, formally, r 

M = {y-y€S,y%a} 

(Note the use of brackets here to denote equivalence classes.) 


Example 8: Consider the set T of all triangles in a plane and the equivalence relation (see Prob¬ 
lem 1) “is congruent to”. When a,b & T we shall mean by [a] the set or class of 
all triangles of T congruent to the triangle a, and by [6] the set or class of all tri¬ 
angles of T congruent to the triangle b. We note, in passing, that triangle a is 
included in [o] and that if triangle c is included in both [a] and [ft] then [a] and [6] 
are merely two other ways of indicating the class [c]. 
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A set {A, B,C,...} of non-empty subsets of a set S will be called a partition of S 
provided (i) AuBuCu = S and (ii) the intersection of every pair of distinct 
subsets is the empty set. The principal result of this section is 

Theorem I. An equivalence relation % on a set S effects a partition of S and, conversely, 
a partition of <S defines an equivalence relation on S. 

Example 9: Two integers will be said to have the same parity if both are even or both are odd. 

The relation “has the same parity as” on I is an equivalence relation. (Prove this.) 
The relation establishes two subsets of /: 

A = {x : x G /, x is even} and B = {*: x € /, * is odd} 

Now every element of / will be found either in A or in B but never in both. 
Hence, A U B = / and A n B = 0, and the relation effects a partition of I. 

Example 10: Consider the subsets A = {3,6,9,.. .,24}, B = {1,4,7,.. .,26} and C={2,5,8.23} 

of S = {1,2,3,...,26}. Clearly, A U B U C = S and A n B = A n C-= B n C = 0, 
so that { A,B,C } is a partition of S. The equivalence relation which yields this par¬ 
tition is “has the same remainder when divided by 3 as”. 

In proving Theorem I, (see Problem 6), use will be made of the following properties 
of equivalence sets: 

(1) a G [a] 

(2) If b G [a], then [6] = [a]. 

(3) If [a] n [6] + 0, then [a] = [b]. 

The first of these follows immediately from the reflexive property a % a of an equivalence 
relation. For proofs of the others, see Problems 4-5. 

ORDERING IN SETS 

Consider the subset A = {2,1,3,12,4} of N. In writ¬ 
ing this set we have purposely failed to follow a natural 
inclination to give it as A = {1,2,3,4,12} so as to point 
out that the latter version results from the use of the 
binary relation (^) defined on N. This ordering of the 
elements of A (also, of N) is said to be total, since for 
every a,b G A (m,n G N) either a < b, o = b, or a > b 
(m <n, m — n, m> n). On the other hand, the binary 
relation (|), (see Problem 27, Chapter 1) effects only a 
partial ordering on A, i.e., 2 j 4 but 2/3. These order¬ 
ings of A can best be illustrated by means of diagrams. 

Fig. 2-1 shows the ordering of A effected by (^). We 
begin at the lowest point of the diagram and follow the 
arrows to obtain 

1 — 2 — 3 — 4 — 12 

It is to be expected that the diagram for a totally ordered set is always a straight line. 
Fig. 2-2 shows the partial ordering of A effected by the relation (|). 

See also Problem 7. 

A set S will be said to be partially ordered (the possibility of a total ordering is not 
excluded) by a binary relation % if for arbitrary a, b, c G S, 

(i) % is reflexive, i.e., a%a 

(ii') % is anti-symmetric, i.e., a%b and b % a if and only if a = b 
(iii) % is transitive, i.e., a%b and b % c implies a % c. 



Fig. M Fig. 2-2 
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It will be left for the reader to check that these properties are satisfied by each of 
the relations (^) and (|) on A and also to verify the properties contain a redundancy in 
that (ii') implies (i). The redundancy has been introduced to make perfectly clear the 
essential difference between the relations of this and the previous section. 

Let S be a partially ordered set with respect to Then: 

(1) every subset of S is also partially ordered with respect to % while some subsets may 
be totally ordered. For example, in Fig. 2-2 the subset {1,2,3} is partially ordered 
while the subset {1,2,4} is totally ordered by the relation (j). 

(2) the element a G S is called a first element of S if a % x for every x G S. 

(3) the element g G S is called a last element of S if x % g for every x G S. 

(The first (last) element of an ordered set, assuming there is one, is unique.) 

(4) the element a G S is called a minimal element of S if x%a implies x = a for every 
xGS. 

(5) the element g G S is called a maximal element of S if g%x implies g = x for every 
xGS. 

Example 11: (a) In the orderings of A of Fig. 2-1 and 2-2 the first element is 1 and the last ele¬ 

ment is 12. Also, 1 is a minimal element and 12 is a maximal element. 

(6) In Fig. 2-3, S = {«, 6, e, 4} has a first element a but no last element. Here, a is 
a minimal element while e and d are maximal elements. 

(c) In Fig. 2-4, S = {o, b, e, d, e} has a last element e but no first element. Here, 
a and b are minimal elements while e is a maximal element. 



Fig. 2-3 Fig. 2-4 


An ordered set S having the property that each of its non-empty subsets has a first 
element, is said to be well-ordered. For example, consider the sets N and Q each ordered 
by the relation (^). Clearly, N is well-ordered but, since the subset {x: xGQ, x>2} 
of Q has no first element, Q is not well-ordered. Is I well-ordered by the relation (—)? 
Is A = {1,2,3,4,12} well-ordered by the relation (|)? 

Let S be well-ordered by the relation %. Then for arbitrary a,b G S, the subset 
{a,b} of S has a first element and so either a%b or b%a. We have proved 

Theorem II. Every well-ordered set is totally ordered. 

OPERATIONS 

Let Q + = { x: x G Q, x > 0}. For every a,bGQ + , we have 
a + 6, b + a, a* b, b • a, a-s-b, b -5- a G <? + 

Addition, multiplication and division are examples of binary operations on Q + . (Note 
that such operations are simply mappings of Q + xQ + into Q + .) For example, addition 
associates with each pair a,b GQ + an element a + b G Q + . Now a + b = b + a but, in 
general, a -j- b ¥= b + a; hence, to insure a unique image it is necessary to think of these 
operations as defined on ordered pairs of elements. Thus 
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A binary operation “o” on a non-empty set S is a mapping which associates with 
each ordered pair (a, b) of elements of S a uniquely defined element o» b of S. In brief, 
a binary operation on a set S is a mapping of S x S into S. 

Example 12: (a) Addition is a binary operation on the set of even 

natural numbers (the sum of two even natural 
numbers is an even natural number) but is not 
a binary operation on the set of odd natural 
numbers (the sum of two odd natural numbers 
is an even natural number). 

( b) Neither addition nor multiplication are binary 
operations on S = {0,1,2,3,4} since, for ex¬ 
ample, 2 + 3 = 5£S and 2*3 = 6«S. 

(c) The adjoining Table 2-1, defining a certain bi¬ 
nary operation ° on the set A — {a, b, e, d, e} is Table 2-1 

to be read as follows: For every ordered pair 

(x, y) of A X A , we find * ° y as the entry common to the row labeled x and the 
column labeled y. For example, the encircled element is d°e (not e°d). 

The fact that o is a binary operation on a set S is frequently indicated by the equivalent 
statement: The set S is closed with respect to the operation o. Example 12(a) may then 
be expressed: The set of even natural numbers is closed with respect to addition; the set 
of odd natural numbers is not closed with respect to addition. 

TYPES OF BINARY OPERATIONS 

A binary operation o on a set S is called commutative whenever x°y = yox for all 
x,y G S. 

Example 13: (a) Addition and multiplication are commutative binary operations while division 

is not a commutative binary operation on Q + . 

(b) The operation ° on A of Table 2-1 is commutative. This may be checked readily 
by noting that (i) each row (6, c, d, e, a in the second row, for example) and the 
same numbered column (b, c, d, e, a in the second column) read exactly the same 
or that (ii) the elements of S are symmetrically placed with respect to the prin¬ 
cipal diagonal (dotted line) extending from the upper left to the lower right of 
the table. 

A binary operation ° on a set S is called associative whenever (xoy)oz = xo(yoz) 
for all x,y,zG S. 

Example 14: (a) Addition and multiplication are associative binary operations on <?+. 

(b) The operation ° on A of Table 2-1 is associative. We find, for instance, 
(6°c)°d = d°d = b and bo(c°d) = b°a = b; (d°e)°d = c°d = a and 
d ° (e ° d) = d ° c = a; .... Completing the proof here becomes exceedingly 
tedious but it is suggested that the reader check a few other random choices. 

(c) Let ° be a binary operation on R defined by 

a ° b — a + 2b for all a,b e R 
Since (a°6)»« = (o + 26)<>c = a + 26 + 2c 

while a°(b°c) = o°(6 + 2e) = a + 2(6 + 2c) = a + 2b + 4c 

the operation is not associative. 

A set S is said to have an identity ( unit or neutral) element with respect to a binary 
operation o on S if there exists an element u G S with the property u°x = x°u — x for 
every x G S. 

Example 15: (a) An identity element of Q with respect to addition is 0 since 0 + a; = * + 0 = x 

for every * e Q; an identity element of Q with respect to multiplication is 1 

since 1 • x = * • 1 = x for every * 6 Q. 

(6) N has no identity element with respect to addition, but 1 is an identity element 
with respect to multiplication. 
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(c) An identity element of the set A of Example 12(c) with respect to ° is a. Note 
that there is only one. 

In Problem 8, we prove: 

Theorem III. The identity element, if one exists, of a set S with respect to a binary 
operation on S is unique. 

Consider a set S having the identity element u with respect to a binary operation o. 
An element y G S is called an inverse of x G S provided x°y = y °x = u. 

Example 16: (a) The inverse with respect to addition, or additive inverse of x G I is — x since 
x + (—x) = 0, the additive identity element of I. In general, xGI does not 
have a multiplicative inverse. 

(6) In Example 12(c), the inverses of a, b, c, d, e are respectively a, e, d, e, b. 

It is not difficult to prove 

Theorem IV. Let o be a binary operation on a set S. The inverse with respect to o of 
x G S, if one exists, is unique. 

Finally, let S be a set on which two binary operations a and ° are defined. The 
operation a is said to be left distributive with respect to o if 

a o (6 o c) = (a □ 6) o (a a c) for all a,b,c G S (a) 

and is said to be right distributive with respect to ° if 

(hoc) a a = (6 □ a) o (c □ a) for all a,b,c G S (b) 

When both (a) and (b) hold, we say simply that a is distributive with respect to °. Note 
that the right members of (a) and (b) are equal whenever a is commutative. 

Example 17: (a) For the set of all integers, multiplication (<=> = •) is distributive with respect to 

addition (° = +) since x • (y + z) = x • y + x • z for all x,y,z G I. 

(b) For the set of all integers, let ° be ordinary addition and a be defined by 
x<=>y = aa 2 -y = x 2 y for all x,y G I 
Since »°(l + c) = a 2 b + a 2 e = (a ■=> 6) + (a □ c) 

1=1 is left distributive with respect to +. Since 

(Hc)°« = ab 2 + 2abc + ac 2 ** (l°«) + (e°«) = b 2 a + e 2 a 
D is not right distributive with respect to +. 


WELL-DEFINED OPERATIONS 

Let S = {a, b, c, ... } be a set on which a binary operation o is defined and let the 
relation % partition S into a set E = {[a], [6], [c],...} of equivalence classes. Let a 
binary operation © on E be defined by 

[a] © [&] = [ao&] for every [a], [6] G E 

Now it is not immediately clear that, for arbitrary p,q G [a] and r,s G [6], we have 

[per] = [qos] = [oob] (c) 

We shall say that © is well-defined on E, that is, 

fa] © M = [q] © M = [a] © [6] 

if and only if (c) holds. 

Example 18: The relation “has the same remainder when divided by 9 as” partitions N into 
nine equivalence classes [1], [2], [3],..., [9], If ° is interpreted as addition on N, 
it is easy to show that © as defined above is well-defined. For example, when 
x.yGN, 9* + 2G[2] and 9y + 5G[5]; then [2] © [5] = [(9* + 2) + (9y + 5)] = 
[9(* 4- y) + 7] = [7] = [2 + 5], etc. 
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ISOMORPHISMS 

Throughout this section we shall be using two sets: 

A = {1,2, 3, 4} and B = {p, q, r, s} 

Now that ordering relations have been introduced, there will be a tendency to note here 
the familiar ordering used in displaying the elements of each set. We point this out in 
order to warn the reader against giving to any set properties which are not explicitly 
stated. In (1) below we consider A and B as arbitrary sets of four elements each and 
nothing more; for instance, we might have used {*,+,$,%} as A or B; in (2) we introduce 
ordering relations on A and B but not the ones mentioned above; in (3) we define binary 
operations on the unordered sets A and B; in (4) we define binary operations on the 
ordered sets of (2). 

(1) The mapping «: 1 <-» p, q, 3 +±r, 4 s 

is one of twenty-four establishing a 1-1 correspondence between A and B. 


(2) Let A be ordered by the relation % = (|) 
and B be ordered by the relation %' as in¬ 
dicated in the diagram of Fig. 2-5. Since 
the diagram for A is as shown in Fig. 2-6, 
it is clear that the mapping 

p : l<->r, 2 <-> s, 4 p 

is a 1-1 correspondence between A and B 
which preserves the order relations, that 
is, for u,v G A and x,y G B with u**x 
and v <-» y then 

u%' v implies x%y 
and conversely. 



Fig. 2-5 Fig. 2-6 


(3) On the unordered sets A and B, define the respective binary operations ° and a with 
operation tables 

A B 



Table 2-2 


Table 2-3 


It may be readily verified that the mapping 

y : l«s, 2 <-» p, 3 r, A±*q 

is a 1-1 correspondence between A and B which preserves the operations, that is, 
whenever 

w G A +* x G B and v G A «-» y G B 
(to be read “w in A corresponds to x in B and v in A corresponds to y in B”), then 
wov <-> x^y 


(4) On the ordered sets A and B of (2), define the respective binary op erations o and 
with operation tables 
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A 

° | 1 2 3 4 

112 3 4 

2 2 4 1 3 

3 3 14 2 

4 4 3 2 1 


Table 2-4 

It may be readily verified that the mapping 


P r s p q 
q 8 p q r 
r p q r s 
s q r s p 

Table 2-5 


P : 1 *-*r, 2 s, 3 q, 4 p 

is a 1-1 correspondence between A and B which preserves both the order relations 
and the operations. 


By an algebraic system S we shall mean a set S together with any relations and 
operations defined on S. In each of the cases (l)-(4) above, we are concerned then with a 
certain correspondence between the sets of the two systems. In each case we shall say 
that the particular mapping is an isomorphism of A onto B or that the systems A and B 
are isomorphic under the mapping in accordance with: 

Two systems 5 and T are called isomorphic provided 

(i) there exists a 1-1 correspondence between the sets S and T, and 

(ii) any relations and operations defined on the sets are preserved in the correspondence. 
Let us now look more closely at, say, the two systems A and B of (3). The binary 

operation o ia both associative and commutative; also, with respect to this operation, 
A has 1 as identity element and every element of A has an inverse. One might suspect 
then that Table 2-2 is something more than the vacuous exercise of constructing a square 
array in which no element occurs twice in the same row or column. Considering the 
elements of A as digits rather than abstract symbols, it is easy to verify that the binary 
operation ° may be defined as: for every x,y G A, xoy is the remainder when x-y is 
divided by 5. (For example, 2-4 = 8 = 1*5 + 3 and 2°4 = 3.) Moreover, the system B 
is merely a disguised or coded version of A, the particular code being the 1-1 correspond¬ 
ence y. 

We shall make use of isomorphisms between algebraic systems in two ways: 

(а) having discovered certain properties of one system (for example, those of A listed 
above) we may without further ado restate them as properties of any other system 
isomorphic with it. 

(б) whenever more convenient, we may replace one system by any other isomorphic with 
it. Examples of this will be met with in Chapters 4 and 6. 


PERMUTATIONS 

Let S = {1,2,3,...,«} and consider the set S n of the n! permutations of these n 
symbols. (No significance is to be given the fact that they are natural numbers.) The 
definition of the product of mappings in Chapter 1 leads naturally to the definition of a 
“permutation operation” ° on the elements of S n . First, however, we shall introduce 
more useful notations for permutations. 

Let it,Ii, i 3 , in be some arrangement of the elements of S. We now introduce a 
two-line notation for the permutation 
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which is simply a variation of the notation for the mapping 

a : la = it, 2 a — ii, 3 a — iz, . . ., Via — in 
Similarly, if ■ ■ - ,U is another arrangement of the elements of 

fl 2 3 ... 

/* = (... 

\Jl H 33 ... In, 

By the product aojjwe shall mean that a and /9 are to be performed in that order. Now 
a rearrangement of any arrangement of the elements of S is simply another arrangement 
of these elements. Thus, for every a,peS n , a o p G S n and ° is a binary operation on S n . 


'-(!! 


and y =4 


Example 19: Let 

three of the 5! permutations in the set S 5 of all permutations 
Since the order of the columns of any permutation is 


write fl as = in which the upper line of fl 

Then \3 2 5 4 1/ 

oa _ /l 2 3 4 5\ 0 /2 3 4 5 l\ _ /1| 
“ P \2 3 4 5 1/ \3 2 5 4 lj 1^3 

Similarly, rewriting aas(J®^®M,we And B ° a = 

° is not commutative. \2 4 3 1 5/ 

Writing y as (* \ ® M , we find 
\4 2 3 5 1/ 

( a o P )oy = (1 2 3 4 5\ „ (s 2 5 4 l\ = 

' 7 \3 2 5 4 1/ \4 2 3 5 


S = (1,2, 3,4, 5}. 
material, we may re- 
the lower line of a. 


v-G; 


It is left for the reader to obtain B 


The identity permutation is J ^ ^ | 

3° a = = «, ••• 

Finally, interchanging the two lines of a, we have 

l 5 1\ _ fl 2 3 4 5\ 
14 5/ “ \5 1 2 3 4/ 


/2 3 4 1 
\1 2 3 < 


since a°a 1 = a 1 ° a = J}. Moreover, it is evident that ev 
an inverse. 

Another notation for permutations will now be introduced. The 

= f 1 2 3 4 5N 

\2 3 4 5 

of Example 19 can be written in cyclic notation as (12345) where the 
interpreted to mean: 1 is replaced by 2, 2 is replaced by 3, 3 by 4, 4 
The permutation . . 

= 1 2 3 4 5 

V 1 2 4 5 3/ 

can be written as (345) where the cycle (345) is interpreted to mean: 1 dnd 2, the missing 


be 


I 4 5\ 
i 4 l/ 


2 3 4 5) 
4 3 15/ 


f l 2 3 4 5\ 
v 4 2 3 5 l/ 


ifiow that (a 0 B) ° 7 = 
isy to show that ° is 


•y element of S 5 has 
permutation 


cycle (12345) is 
iby 5, and 5 by 1. 



24 


RELATIONS AND OPERATIONS 


[CHAP. 2 


symbols, are unchanged while 3 is replaced by 4, 4 by 5, and 5 by 3. The permutation /? 
can be written as (23)(45). The interpretation is clear: 1 is unchanged; 2 is replaced by 3 
and 3 by 2; 4 is replaced by 5 and 5 by 4. We shall call (23)(45) a product of cycles. Note 
that these cycles are disjoint, i.e., have no symbols in common. Thus, in cyclic notation 
we shall expect a permutation on n symbols to consist of a single cycle or the product of 
two or more mutually disjoint cycles. Now (23) and (45) are themselves permutations of 
S = (1,2,3,4,5} and, hence, /3 = (23)o(45) but we shall continue to use juxtaposition to 
indicate the product of disjoint cycles. The reader will check that a °fi = (135) and 
/3° a — (124). In this notation the identity permutation 3 will be denoted by (1). 

See Problem 11. 


TRANSPOSITIONS 

A permutation such as (12), (25), . .. which involves interchanges of only two of the 
n symbols of S = {1,2,3, ...,«} is called a transposition. Any permutation can be 
expressed, but not uniquely, as a product of transpositions. 

Example 20: Express each of the following permutations 

(a) (23), ( b) (135), (c) (2345), (d) (12345) 

on S = {1,2,3,4,5) as products of transpositions. 

(o) (23) = (12)M23 )o( 13) = (12)o(13)o(l2) 

(6) (135) = (13)°(15) = (15)o(35) = (15) ° (13) o (15) o (13) 

(c) (2345) = (23)o(24)o(25) = (25) ° (34) o (35) 

(d) (12345) = (12)o(13)o(14)o(15) 


The example above illustrates 


Theorem V. Let a permutation a on n symbols be expressed as the product of r trans¬ 
positions and also as a product of s transpositions. Then r and s are either 
both even or both odd. For a pr00f ^ Problem 12 


A permutation will be called even (odd) if it can be expressed as a product of an even 
(odd) number of transpositions. In Problem 13, we prove 

Theorem VI. Of the n\ permutations of n symbols, half are even and half are odd. 
Example 20 also illustrates 

Theorem VII. A cycle of m symbols can be written as a product of m-1 transpositions. 


ALGEBRAIC SYSTEMS 

Much of the remainder of this book will be devoted to the study of various algebraic 
systems. Such systems may be studied in either of two ways: 

(a) we begin with a set of elements (for example, the natural numbers or a set isomorphic 
to it), define the binary operations addition and multiplication, and derive the familiar 
laws governing operations with these numbers. 

(b) we begin with a set S of elements (not identified); define a binary operation o; lay 
down certain postulates, for example, (i) ° is associative, (ii) there exists in S an 
identity element with respect to o, (iii) there exists in S an inverse with respect to ° 
of each element of S'; and establish a number of theorems which follow. 

We shall use both procedures here. In the next chapter we shall follow (a) in the study 
of the natural numbers. 
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Solved Problems 

1. Show that “is congruent to” on the set T of all triangles in a plane is an equivalence 
relation. 

(i) “a is congruent to a for all a G T” is valid. 

(ii) “If a is congruent to b, then b is congruent to a” is valid. 

(iii) “If a is congruent to b and b is congruent to c, then a is congruent to c’ is valid. 

Thus “is congruent to” is an equivalence relation on T. 

2. Show that “ < ” on / is not an equivalence relation. 

(i) “a < a for all a G /” is not valid. 

(ii) “If a < b, then b < a” is not valid. • 

(iii) “If a < b and b < c, then a < c” is valid. 

Thus “<” on I is not an equivalence relation. (Note that (i) or (ii) is sufficient.) 

3. Let % be an equivalence relation and assume c%a and c%b. Prove a%b. 

Since c%a, then a%c (by the symmetric property). Since a%c and c%b then a%b (by the 
transitive property). 


4. Prove: If 6 e [a], then [6] = [a]. 

Denote by % the equivalence relation defining [a]. By definition, b G [a] implies b % a and 
x G [6] implies x%b. Then x%a for every x G [6] (by the transitive property) and [6] C [<*]. 
A repetition of the argument using a%b (which follows by the symmetric property of %) and y%a 
(whenever y G [a]) yields [a] c [6], Thus [6] = [a], as required. 


5. Prove: If [a] n [6] v 4 0, then [a] = [6]. 

Suppose [a] n [6] = (r,s, ...}. Then [r] = [a] and [r] = [6] (by Problem 4), and [a] = [6] 
(by the transitive property of =). 


6. Prove: An equivalence relation 1( on a set S effects a partition of S and, conversely, 
a partition of S defines an equivalence relation on S. 

Let % be an equivalence relation on S and define for each pGS, 

T P = [P] = {x.x&S,x%p} 

Since p G [p], it is clear that S is the union of all the distinct subsets T a , T b , T,, . . . induced by %. 
Now for any pair of these subsets, as T b and T c , we have T b n T c = 0 since, otherwise, T b = T c 

by Problem 5. Thus, {T a , T b , T c , .. .} is the partition of S effected by %. 

Conversely, let {T a ,T b ,T c , ...} be any partition of S. On S define the binary relation % by 
p%q if and only if there is a in the partition such that p, q G T f . It is clear that % is both 
reflexive and symmetric. Suppose p%q and q%r\ then by the definition of H there exist subsets 
Tj and T k (not necessarily distinct) for which p, q G T } and q, rG T k . Now T, n T k ¥= 0 and 

so Tj = T k . Since p,rG T jy then p%r and % is transitive. This completes the proof that % is an 

equivalence relation. 
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7. Diagram the partial ordering of (a) the set of subsets of S = {a, b, c} effected by the 
binary relation (C), ( b ) the set B = {2,4,5,8,15,45,60} effected by the binary relation (|). 




Fig. 2-8 


These figures need no elaboration once it is understood a minimum number of line segments is to 
be used. In Fig. 2-7, for example, 0 is not joined directly to {a, 6, e} since 0 £ (a, b, e } is indicated 
by the path 0 Q {a} C {a, b} C {a, 6, c}. Likewise, in Fig. 2-8, segments joining 2 to 8 and 5 to 45 
are unnecessary. 


8. Prove: The identity element, if one exists, with respect to a binary operation o on a 
set S is unique. 

Assume the contrary, that is, assume g, and g 2 to be identity elements of S. Since is an 
identity element we have g t ° g 2 — 9 2 , while since g 2 is an identity element we have g t ° g 2 = q t . 
Thus gi = gi 0 g 2 = g 2 ; the identity element is unique. 


9. Show that multiplication is a binary operation on S = {1,-1, i,—i) where i = y/-l- 


This can be done most easily by forming the adjoining table and 
noting that each entry is a unique element of S. 

The order in which the elements of S are placed as labels on the 
rows and columns is immaterial; there is some gain, however, in using 
the same order for both. 

The reader may show easily that multiplication on S is both asso¬ 
ciative and commutative, that 1 is the identity element and that the 
inverses of 1 , — 1 , i, —t are respectively 1 , — 1 , —i, i. 



Table 2-6 


10. Determine the properties of the binary operations o and a defined on S - {a, b, c, d} 
by the given tables: 


6 6 c d 


c e d a 6 

d d a b c 


<=> a b c d 
a d a c b 
b a c b d 
c b d a e 
d e b d a 


Table 2-7 


Table 2-8 



CHAP. 2] 


RELATIONS AND OPERATIONS 


27 


The binary operation ° defined by Table 2-7 is commutative (check that the entries are sym¬ 
metrically placed with respect to the principal diagonal) and associative (again, a chore). There is an 
identity element a (the column labels are also the elements of the first column and the row labels are 
also the elements of the first row). The inverses of a, b, e, d are respectively a,d,e,b (a°a= b°d = 
e °c = dob = a). 

The binary operation a defined by Table 2-8 is neither commutative (a a e ■■= e, e <=> a = 6) nor 

associative (a n (6 □ e) = a a 6 = a, (a □ 6) a c = a a e = e). There is no identity element and, hence, 

no element of S has an inverse. 

Since a a (doc) — a * d = (a a d) ° (o a c) and (doc) a a ¥■ (d a a) “(c a a), a is neither left 
nor right distributive with respect to °; since d°(c<=>6) = c ¥* a = (d° c) <=> (d° b) and ° is commu¬ 
tative, ° is neither left nor right distributive with respect to <=>. 

11. (a) Write the permutations (23) and (13)(245) on 5 symbols in two line notation. 

( b ) Express the products (23)°(13)(245) and (13)(245) °(23) in cyclic notation. 

(c) Express in cyclic notation the inverses of (23) and of (13)(245). 

<«> (23) = (1 s 2 i s) *” d < 13 >< 245 > = (j 1 ! 5 ') 

(6) (23) ° (13)(245) = () J | 1 {) * (l ! “ {) = (1 J ! i !) = ( 13 «® 

“ d (i3,(245) ° (23) - (i:!*,;)• (sii!!)-(;;;*;)»<««., 

(') T >» <«~» «' (23) <• (I 2 3 1 S) = (1 3 2 1 l) = |2S) - 

The inverse of (1S)(245) is (? 2 3 4 5) = (s 5 1 2 l) = (13)1254). 

12. Prove: Let a permutation a on n symbols be expressed as the product oJ: r transpositions 
and also as the product of s > r transpositions. Then r and s are either both even or 
both odd. 

Using the distinct symbols x lt x 2 ,x s . x n , we form the product 

A = (*! - x 2 )(x t - X 3 )(*, - * 4 ) • • • (*! - x n ) 

(x 2 - x 3 )(x 2 -x t ) ■■■ (x 2 - x n ) 


(»„_!-X B ) 

A transposition (u, v), where u < v, on A has the following effect: (1) any factor which involves 
neither x„ nor x v is unchanged, (2) the single factor x u — x„ changes sign, (3) th« remaining factors, 
each of which contains either x u or x v but not both, can be grouped into pairs, (x„ — x w )(x v — x w ) 
where u<v<w, (x u — x„)(x w — x v ) where u<w<v, and (x„ — x u )(x„ — x„) wherj w<u<v, which 
are all unchanged. Thus, the effect of the transposition on A is to change its sign. 

Now the effect of a on A is to produce (—l) r A or (—1)»A according as a is written as the product 
of r or s transpositions. Since (—l) r A = (—1)*A, we have A = (—1)*-»A so that t — r is even. Thus, 
r and s are either both even or both odd. 


13. Prove: Of the n! permutations on n symbols, half are even and half are odd. 

Denote the even permutations by Pi,p 2l Pa, ■ ■ - ,P U and the odd permutations ay <h, q 2 >Qa> ■ ■ - ><lv 
Let t be any transposition. Now t°Pi, t°p 2 , t°p 3 , ..., t °p u are permutations cn n symbols. They 
are distinct since p lf p 2 , p 3 , ..., p„ are distinct and they are odd; thus, u — v. Also, t ° q u t ° q 2 , 
t°q 3 . ..., t°q v are distinct and even; thus, v — u. Hence u = v = •£»! 
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Supplementary Problems 

14. Which of the following are equivalence relations? 

(а) “Is similar to” for the set T of all triangles in a plane. 

(б) “Has the same radius as” for the set of all circles in a plane. 

(c) “Is the square of” for the set N. 

(d) “Has the same number of vertices as” for the set of all polygons in a plane. 

(e) “Q” for the set of sets S = {A,B, C, . . .}. 
if) for the set R. 

Ans. (a), (6), (d ). 

15. (a) Show that “is a factor of” on N is reflexive and transitive but is not symmetric. 

(6) Show that “costs within one dollar of” for men’s shoes is reflexive and symmetric but not 
transitive. 

(c) Give an example of a relation which is symmetric and transitive but not reflexive. 

(d) Conclude from (a), (6), (c) that no two of the properties reflexive, symmetric, transitive of a 
binary relation implies the other. 

16. Diagram the partial ordering of 

(а) A = {1,2,3,6} 

(б) B = {1,2,3,5,30,60} and (c) C = {1,3,5,15,30,45} 
effected on each by the relation (|). 

17. Let S = {a,b,e,d,e,f} be ordered by the relation % as shown in 
Fig. 2-9. (a) List all pairs x,y € S for which x%y. ( b ) List all sub¬ 
sets of three elements each of which are totally ordered. 

18. Verify: 

(a) the ordered set of subsets of S in Problem 7(a) has 0 as first element (also, as minimal element) 
and 5 as last element (also, as maximal element). 

(b) the ordered set B of Problem 7(5) has neither a first nor last element. What are its minimal and 
maximal elements? 

(c) the subset C = {2,4,5,15,60} of B of Problem 7(5) has a last element but no first element. 
What are its minimal and maximal elements? 



19. Show: 

(а) multiplication is a binary operation on S = {1,-1} but not on T = {1,2}, 

(б) addition is a binary relation on S = {* :*€/,*< 0} but multiplication is not. 

20. Let S = {A,B, C, D} where A = 0, B — {a, 6}, C = {a, c}, D = {a, 5, c}. Construct tables to show 
that U is a binary relation on S but n is not. 


21. For the binary relations ° and a defined on S = {a, 6, e, d, e} by Tables 2-9 and 2-10, assume 
associativity and investigate for all other properties. 


5 d 

d d 


5 c 

~d 7 

5 5 

5 c 

d d 



Table 2-9 


O I a 5 c d e 



22. Let S = { A,B,C,D} where A = 0, B = {a}, C = {a, 5}, D = {a,6,c}. 

(а) Construct tables to show that U and n are binary operations on S. 

(б) Assume associativity for each operation and investigate all other properties. 
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23. For the binary operation on S = {o, b, e, d, e, f, g, h,} 
defined by Table 2-11, assume associativity and in¬ 
vestigate all other properties. 


24. Show that ° defined in Problem 23 is a binary opera¬ 
tion on the subsets S 0 = (a), S 1 = {a, c}, S 2 = {a, e), 
S 3 = {a, /}, S 4 = {a,g}, S 5 = {a,h}, S 6 = {a,b,c,d}, 
S 7 = {a, e, e, /}, S 8 = {a, c, g, h} but not on the sub¬ 
sets T x = {a, 6} and T 2 = (a, f, g} of S. 


25. Prove Theorem IV. Hint-. Assume y and z to be 
inverses of x and consider z°(x°y) = ( z°x)°y. 


b b 

d d 


b 

~b 

d 


e e g 

f f h 

9 9 f 

h h e 


e d e 

~e d e 

d a h 

a b f 

beg 
f h a 

e 9 e 

h e d 

9 f b 

Table 2-11 


/ 

T 

9 

h 


b 

d 


9 

9 

h 

f 

b 

d 


h 

~h 

f 

9 

d 

b 


26. (a) Show that the set N of all natural numbers under addition and the set M = {2x : xG N} under 

addition are isomorphic. Hint: Use nGN 2nGM. 

(b) Is the set N under addition isomorphic to the set P = {2x — 1: x G N) under addition? 

(c) Is the set M of (a) isomorphic to the set P of (6)? 

27. Let A and B be sets with respective operations ° and D • Suppose A and B are isomorphic and show: 

(а) if the associative (commutative) law holds in A it also holds in B. 

(б) if A has an identity element u, then its correspondent u' is the identity element in B. 

(e) if each element in A has an inverse with respect to °, the same is true of the elements of B with 
respect to a . 

Hint: In (a), let aGA*+a'GB, b +* b', c Then 

a o (b ° c) a' o (b' a e ’), (a°b)°c ** (o' ■=■ 6') ° c' 

and o o (b ° c) = (a°b)°c implies o' 1=1 (&' 1=1 c') = ( a’ = 6') a c' 


28. Express each of the following permutations on 8 symbols as a product of disjoint cycles and as a 
product of transpositions (minimum number). 


1 2 3 4 5 6 7 8\ 
23415678 / 


(6) ( 


1234567 8\ 
34567812/ 


(«) 


/I 2 3 4 5 6 7 
4 1 6 8 2 7 6/ 


(d) (2468)o(348) (e) (15)(2468) ° (37)(16468) (/) (135) ° (3456) o (4678) 


Partial answer, (o) (1234) = (12)(13)(14) 

(e) (13)(246)(58) = (13)(24)(26)(58) 

(d) (28)(346) = (28)(34)(36) 

(/) (1637845) = (16)(13)(17)(18)(14)(15) 


Note: For convenience, ° has been suppressed in indicating the products of transpositions. 


29. Show that the cycles (1357) and (2468) of Problem 28(5) are commutative. State the theorem 
covering this. 


30. Write in cyclic notation the 6 permutations on S = (1,2,3), denote them in some order by 
p lt p 2 ,P3, ••■,Pe> and form a table of products Pj°p 3 -. 


31. Form the operation (product) table for the set S = {(1), (1234), (1432), (13)(24)} of permutations 
four symbols. Using Table 2-3, show that S is isomorphic to B. 




Chapter 3 


The Natural Numbers 

THE PEANO POSTULATES 

Thus far we have assumed those properties of the number systems necessary to 
provide examples and exercises in the earlier chapters. In this chapter we propose to 
develop the system of natural numbers assuming only a few of its simpler properties. 
These simple properties, known as the Peano Postulates (Axioms) after the Italian 
mathematician who in 1899 inaugurated the program, may be stated as follows: 

Let there exist a non-empty set N such that 
Postulate 1: 1 G N 

Postulate II : For each « € N there exists a unique to* G N, called the successor of n. 
Postulate III : For each n G N we have n* ¥> 1. 

Postulate IV : If m,n G N and m* = to*, then m = ». 

Postulate V : Any subset K of N having the properties 

(a) lex 

(b) k* GK whenever k G K 

is equal to N. 

First, we shall check to see that these are in fact well-known properties of the natural 
numbers. Postulates I and II need no elaboration; III states that there is a first natural 
number 1; IV states that distinct natural numbers m and n have distinct successors m +1 
and n + 1; V states essentially that any natural number can be reached by beginning with 1 
and counting consecutive successors. 

It will be noted that, in the definitions of addition and multiplication on N which 
follow, nothing beyond these postulates is used. 

ADDITION ON N 

Addition on N is defined by 

(i) to +1 = to*, for every to G N 

(ii) to + to* = (to + m)* whenever to + m is defined. 

It can be shown that addition is then subject to the following laws: 

For all m,n,p G N, 

Ai. Closure Law n + mGN 

A 2 . Commutative Law n + m = m + n 

Aa. Associative Law m + (n + p) — (m + n) + p 

Ai. Cancellation Law If m + p — n + p, then m — n. 

30 
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MULTIPLICATION ON N 

Multiplication on N is defined by 

(iii) n • 1 = n 

(iv) n • m* = n-m + n whenever «• m is defined. 

It can be shown that multiplication is then subject to the following laws: 
For all m,n,p G N, 

Mi. Closure Law n'mGN 

M 2 . Commutative Law m-n = n-m 

M 3 . Associative Law m • (n • p) = (w n) • p 

Mi. Cancellation Law If m~p = n-p, then m = n. 

Addition and multiplication are subject to the Distributive Laws: 

For all m,n,p G N, 

Di. m-(n + p) = wn + rri'p 
D 2 . (n + p) • m = n • m + p • m 


MATHEMATICAL INDUCTION 
Consider the proposition 

P(m): 


6 m, for every m G N 


We shall now show how this proposition may be established using only the Postulates 
I-V. Define 

K = {k : k GN, P(k) is true} 


Now 

and 

Thus 


1 GN 
1 * + 1 

P(l) is true and 1 G K 


(Postulate I) 
(Postulate III) 


Next, let k be any element of K; then 

(a) P{k): k* + k 


is true. 
Hence 


Now if (k*)* = k* it follows by Postulate IV that k* = k, a contradiction of (a). 
P(k*): (k*)* # k* 


is true and so k* G N. Now K has the two properties stated in Postulate V; thus, K — N 
and the proposition is valid for every mGN. 


In establishing the validity of the above proposition, we have at the same time 
established the following 

Principle of Mathematical Induction 

A proposition P(m) is true for all mGN provided: 

P(l) is true 


and, for each k GN, P{k) is true implies P{k*) is true 


The several laws A 1 -A 4 , M 1 -M 4 , D 1 -D 2 can be established by mathematical induction. 
Ai is established in Example 1, A 3 in Problem 1, A 2 in Problems 2 and 3, and D 2 in 
Problem 5. 
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Example 1: Prove the Closure Law: n + mGN for all m,nGN. 

We are to prove that n + m is defined (is a natural number) by (i) and (ii) 
for all m,n G N. Suppose n to be some fixed natural number and consider the 
proposition 

P(m ): n + mGN, for every mGN 


is true since n + 1 = n* (by (i)) and n* G N (by Postulate II). Suppose next that 
for some kGN 


(a) 

It then follows that 


P(k): n + kGN is true 

P(k*): n + k*GN 


is true since n + k* — (n + k)* (by (ii)) and (n + fc)* G N whenever n + kGN 
(by Postulate II). Thus, by induction, P(m) is true for all mGN and, since n was 
any natural number, the Closure Law for Addition is established. 

In view of the Closure Laws Ai and Mi, addition and multiplication are (see Chapter 2) 
binary operations on N. The laws A 3 and M 3 suggest as definitions for the sum and 
product of three elements a lf a 2 ,03 G N, 

(v) ®1 + 0,2 + 0,3 = (<ll + <*2) + ®3 = Ol + (®2 + U3) 

and 

(vi) 


ai*a2*a$ = (ai • a 2 ) • <13 = ai • (a 2 • 0 - 3 ) 


Note that for a sum or product of three natural numbers, parentheses may be inserted 
at will. The sum of four natural numbers is considered in Problem 4. The general case 
is left as an exercise. 

In Problem 6, we prove 

Theorem I. Every element n# 1 of N is the successor of some other element of N. 


THE ORDER RELATIONS 

For each m,n GN, we define “<” by 

(vii) m < n if and only if there exists some p € N such that m + p = n. 

In Problem 8 it is shown that the relation < is transitive but neither reflexive nor 

symmetric. By Theorem I, , 

1 < n for all n # 1 

and, by (i) and (vii), n<n* for all n GN 

For each m,n G N, we define “>” by 

(viii) m> n if and only if n <m 

There follows 

The Trichotomy Law: For any m,n G N one and only one of the following 
(a) m = n, (6) m <n, (c) m> n 

is true. For a proof, see Problem 10. 

Further consequences of the order relations are Theorems II and II': 

Theorem II. If m,nGN and m<n, then for each p G N, 

(a) m + p < n + p (b) m-p < n-p 
and, conversely, if (a) or (6) hold for some p G N, then m <n. 
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Theorem II'. If m,nGN and m > n, then for each p G N, 

(а) m + p > n + p 

(б) m'p > n-p 

and, conversely, if (a) and (6) hold for some j)£JV, then m>n. 

Since Theorem II' is merely Theorem II with m and n interchanged, it is clear that 
the proof of any part of Theorem II (see Problem 11) establishes the corresponding part 
of Theorem II'. 

The relations “less than or equal to” (—) and “equal to or greater than” (—) are 
defined as follows: 

For m,n 6JV, m-» if either m<n or m = n 

m — n if either m — n or m> n 

holds. 

Let A be any subset of N (i.e., A C N). An element p of A is called the least element 
of A provided p ^ a for every a G A. Notice that in the language of sets, p is the first 
element of A with respect to the ordering ^. In Problem 12, we prove 
Theorem III. The set N is well-ordered. 

MULTIPLES AND POWERS 

Let a G S, on which binary operations + and • have been defined, and define 
la = a a 1 = a 

and (fc + l)a = Jca + a a k+1 = a k * a 

whenever ka and a k , for k G N, are defined. 

Example 2: Since la = a and a 1 = a, we have 

2 a = (1 + l)a = la + la = a + a and a 2 = a 1 + 1 = a 1 * a = a • a 

3a = (2 + l)a = 2a + la = a + a + a and a 3 = a 2+1 = a 2 • a = a * a * a 

etc. 

It must be understood here that in Example 2 the + in 1 + 1 and the + in a + a 
are to be presumed quite different, since the first denotes addition on N and the second 
on S'. (It might be helpful to denote the operations on S by © and O). In particular, 
k = a + a+ ••• + a is a multiple of a, and can be written as k • a only if k G S. 

Using the induction principle, the following properties may be established for all 
a,b G S and all m, n G N: 

(ix) ma + na = (m + n)a (ix)' a m *a n = a m+B 

(x) m(na) = (m • n)a (x)' (a") m - a m n 

and, when + and • are commutative on S, 

(xi) na + nb = n(a + b) (xi)' a n -b n = (ab) n 

ISOMORPHIC SETS 

It should be evident by now that the set {1,1*,(1*)*, ...}, together with the opera¬ 
tions and relations defined on it developed here differs from the familiar set (1,2,3, ...} 
with operations and relations in vogue at the present time only in the symbols used. 
Had a Roman written this chapter he would, of course, have reached the same conclusion 
with his system (I,II,III, ...}. We say simply that the three are isomorphic. 
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Solved Problems 

1. Prove the Associative Law A 3 : m + (n + p) — (m + n) + p for all m,n,p G N. 

Let m and n be fixed natural numbers and consider the proposition 

P(p) : m+(n + p) = (m + n) + p for all pG N 

We first check the validity of P(l): m + (it + 1) = (m + n) + 1. By (i) and (ii), Page 30, 
m + («+1) = m + n* = (m + n)* = (m + n) + 1 

and P(l) is true. 

Next, suppose that for some k G N, 

P(k): m+ (n + k) = (m + n) + k 

is true. We need to show that this assures 

P(k *): m + (n + k*) = (m + n) + k* 

is true. By (ii), m + (n + k*) = m + (n + k)* = [m + (w + fc)]* 

and (m + n) + k* = [(m+ «) + *]* 

Then, whenever P(k) is true, 

m + (n + k)* = [m + (»+fc)]* = [(m+ «) + &]* = (m + n) + k* 
and P(k*) is true. Thus, P(p) is true for all p € N and, since m and n were any natural numbers, 
A 3 follows. 


2. Prove P(n): n + l = l + » for all nGN. 

Clearly P(l): 1 + 1 = 1 + 1 is true. Next, suppose that for some k& N, 

P(k): k + 1 = 1 + k 

is true. We are to show that this assures 

P(k*): k* + 1 = 1 + k* 

is true. Using in turn the definition of k*, A 3 , the assumption that P(k) is true, and the definition 
of k*. we have ^ + ^ _ 1 + (k + 1) _ (i + fc) + i = (fc+ 1) + 1 = fc* + 1 

Thus P(k*) is true and P(n) is established. 


3. Prove the Commutative Law A 2 : m + n = n + m for all m,n G N. 

Let n be a fixed but arbitrary natural number and consider 

P(m) : m + n = n + m for all m G N 

By Problem 2, P(l) is true. Suppose that for some k G N, 

P(k) : k + n = n + k 

is true. Now 

k* + n = (k+1) + n = fc + (l + n) = k+(n+1 ) = k + n* = (k + n)* = (n + k)* = n + k* 
Thus P(k*): k* + n = n + k* 

is true and A 2 follows. 

The reader will check carefully that in obtaining the sequence of equalities above, only definitions, 
postulates, such laws of addition as have been proved and, of course, the critical assumption P(k) is 
true for some arbitrary k G N have been used. Both here and in later proofs, it will be understood 
that when the evidence supporting a step in a proof is not cited, the reader is expected to supply it. 
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4. (a) Let ai, a 2 , a 3 , a 4 GN and define a,i + a 2 + a 3 + a* = (ai + a 2 + a 3 ) + a 4 . Show that 

in a t + a 2 + a 3 + a 4 we may insert parentheses at will. 

Using (v), we have <*i + a 2 + «s + = (<*i + o 2 + a 3 ) + a 4 = (a l + n 2 ) + a 3 + a 4 = (a t + a 2 ) 4 - 

(a 3 + o 4 ) = aj + a 2 + (o 3 + a 4 ) - a, 4 - (o 2 4- a 3 + a 4 ), etc. 

(b) For b,ai,a 2 ,a 3 G N, prove b • (ai 4-a 2 4-a 3 ) = b*ai 4- b*a 2 4- b*a 3 . 

b • (aj + a 2 + a 3 ) = 6 • [(«! + a 2 ) + a 3 ] = 6 • (o t + a 2 ) + 6*a 3 = 6 *Oj + 6’o 2 + 6*o 3 

5. Prove the Distributive Law D 2 : (n + p) • m = n • m + p • m for all m,n,p € N. 

Let n and p be fixed and consider 

P(m) : (» + p)*m = n-m + p-m for all m G N 

Using Aj and (iii), we find 

P(l): (w + p)*l = w + p = «-l + p-l 

is true. Suppose that for some fc € N, 

P(k): (» + p) • fc = n-fc + p-fc 

is true. Then (n + p) • fc* = (» + p) • k + (» + p) = n • k + p • k + w + p 

= n*fc + (p*fe + «) + p = wfc+(« + p-fc) + p 
= (n • fc + n) + (p • A: + p) = w • fc* + p • fc* 

Thus, P(fc*): (n + p)*fc* = wfc* + p*fc* 

is true and D 2 is established. 

6. Prove: Every element n * 1 of N is the successor of some other element of N. 

First, we note that Postulate III excludes 1 as a successor. Denote by K the set consisting of 
the element 1 and all elements of N which are successors, i.e., 

K = {fc : fc € N, k = 1 or fc = m* for some m e N) 

Now every fc € K has a unique successor fc* G N (Postulate II) and, since fc* is a successor, we have 
fc* £ K. Then K = N (Postulate V). Hence, for any n&N we have either » = 1 or n = m* for 
some m G IV. 


7. Prove: m + n¥=m for all m,nGN. 

Let n be fixed and consider P(m): m + n¥=m for all m € N. By Postulate III, P(l): 1 + n? 4 1 
is true. Suppose that for some fc £ AT, 

(a) P(fc): fc + w *= fc 


is true. Now (fc + n )* 
of (a). Thus 


fc* since, by Postulate IV, (fc + n)* = fc* implies k + n — k, 
P(fc*): fc* + « ^ fc* 


is true and the theorem is established. 


a contradiction 


8. Show that < is transitive but neither reflexive nor symmetric. 

Let m, n, p £ IV and suppose that m < n and n < p. By (vii) there exist r, s £ N such that 
m + r = n and n + * = p. Then 

n + s = (m + r) + * = m + (r +«) = p 
Thus, m < p and < is transitive. 
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Let n G JV. Now n < n is false since, if it were true, there would exist some k G JV such that 
n + k = n, contrary to the result in Problem 7. Thus, < is not reflexive. 

Finally, let to, n G JV and suppose to < n and n < m. Since < is transitive, it follows that 
m < m, contrary to the result in the paragraph immediately above. Thus, < is not symmetric. 

9. Prove: 1 — n, for every n&N. 

When n = 1 the equality holds; otherwise, by Problem 6, n = m* = m+ 1, for some to G JV, 
and the inequality holds. 

10. Prove the Trichotomy Law: For any m,n G N, one and only one of 

(a) m — n (6) m <n (c) m> n 

is true. 

Let to be any element of N and construct the subsets 

Nj = {to}, JV 2 = {x: x G JV, x < to}, N 3 = {x: x G JV, x > to} 

We are to show that {JV lf JV 2 , JV 3 } is a partition of N relative to {=, <, >). 

(1) Suppose m = 1; then N 1 = {1}, JV 2 = 0 (Problem 9) and JV 8 = {»:*€ JV, x > 1}. Clearly 

IVj U JV 2 U N a — JV. Thus, to complete the proof for this case, there remains only to check 

N, n N 2 = Nj n N a = N 2 n N a = 0. 

(2) Suppose m* 1. Since 1 e N 2 , it follows that 1 6 U N 2 U N a . Now select any n# 16 
(Vj U N 2 U N a . There are three cases to be considered; 

(i) n e #!• Here, n = m and so n* € N a . 

(ii) n€N 2 so that n + p - m for some p € N. If p = 1, then n*=m,€N 1 ; if p * 1 so 

that p = l + g for some q€N, then n* + q = m and so n* € N 2 . 

(iii) w € N a . Here n* > n> m and so n* € N a . 

Thus, for every nSN, 

n € Nj U N 2 U N a implies n* € Nj U N 2 U N a 
Since 1 S U N 2 U N a we conclude that N = U N 2 U N a . 

Now m € N 2 , since m < m; hence, n N 2 = 0. Similarly, m > m and so n N a = 0. 

Suppose N 2 n N a = {p> for some pGN. Then p < m and p > m or, what is the same, p < to 

and to < p. Since < is transitive, we have p < p, a contradiction. Thus, we must conclude 

that JV 2 n N a = 0 and the proof is now complete for this case. 

11. Prove: If m,n &N and m<n, then for each p € N, m + p<n + p and conversely. 

Since w < n, there exists some k £ N such that m + k = n. Then 

n + p = (m + k) + p = TO + fc + p = TO + p + fc = (m + p) + k 

and so m+ p < n + p 

For the converse, assume m + p < n + p. Now either to = n, m < n, or to > n. If m — n, then 
m + p = n + p; if m > n, then to + p > n + p (Theorem II'). Since these contradict the hypothesis, 
we conclude that m < n. 


12. Prove: The set N is well-ordered. 

Consider any subset of IV. We are to prove that S has a least element. This is certainly 

true if 1 G S. Suppose ICS; then 1 < s for every * G S. Denote by K the set 
K — {k: ke. N, k ^ s for each s G S) 

Since IGA, we know that K^0. Moreover K ¥= N; hence, there must exist anrGK such that 
r* € K. Now this r G S since, otherwise, r < s and so r* ^ s for every s G S. But then r* G K, a 
contradiction of our assumption concerning r. Thus, S has a least element. Now S was any non¬ 
empty subset of JV; hence, every non-empty subset of JV has a least element and JV is well-ordered. 
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Supplementary Problems 

13. Prove by induction: 1 • n = n for every « G N. 

14. Prove by induction: (a) M 4 , (6) M 2 , (c) M 3 . 

Hint: Use the result of Problem 13 and D 2 in (6). 

15. Prove: (a) D x by following Problem 5, (6) D 4 by using M 2 . 

16. Prove: (a) ( m + n*)* = m* + n* 

(b ) (m • n*)* = w n + m* 

(e) (m* • n*)* = m* + m-n + n* 
where m,n G N. 

17. Prove: (a) (m + n) • (p + q) — (m • p + m • q) + (m • p + n • q) 

(b) w(n + p)-q — (m • n) • g + m • (p • g) 

(c) m* + n* = (m + n)* + 1 

(d) m* -n* = (m-n)* + m + n 

18. Let m,n,p,q G N and define m - n - p • q = (m.' n* p) • q. (a) Show that in m • n • p • g we may 

insert parentheses at will. ( b) Prove: w(n + p + g) = m-n. + m*p + m*<j. 

19. Identify S = {* : * G N, n < x < n* for all « G N }. 

20. If m,n,p,q G N and if m<n and p < q, prove: (a) m + p < n + q, (b) m-p < n-q 

21. Let m,n&N. Prove: (a) If m = n, then k* *to > n for every fc G N. (b) If k* ’m = n for some 
k G N, then m < n. 

22. Prove A 4 and M 4 using the Trichotomy Law and Theorems II, Page 32, and II', Page 33. 

23. For all m&N define m 1 = m and = m? • m provided mP is defined. When m, n,p,qG. N, 

prove: (a) m" • = ro»+«, (6) = m» «, (c) (wn)»> = mP • n”, (d) (1)p = 1. 

24. For m,n&N show that (a) m 2 < m-n < n 2 if m < n, (6) m 2 + n 2 > 2m • n if m # «. 

25. Prove, by induction, for all n G N: 

(а) 1 + 2 + 3+ •••+» = £n(n+1) 

(б) I* + 2 2 + 3 2 + ■ • • + n 2 = \n(n + l)(2n + 1) 

(c) 13 + 23 + 3» + • • • + n* = + I) 2 

26. For a 4 , o 2 ,03, ..., a n G N define o 4 + 03 + o 3 + • • • + a fc = (a 4 + a 2 + o 3 + • • • + a^— j) + a & for 
k = 3,4,5,...,«. Prove: 

(а) a 4 + a 2 + a 3 + • • • + a n = (a 4 + a 2 + a a + • • • + a r ) + (a r+1 + a r+2 + o r+3 + •••+«„) 

(б) In any sum of n natural numbers, parentheses may be inserted at will. 

27. Prove each of the following alternate forms of the Induction Principle: 

(а) With each n G N let there be associated a proposition P(n). Then P(n) is true for every n G N 
provided: 

(i) P(l) is true. 

(ii) For each m G IV the assumption P(k) is true for all k < m implies P(m) is true. 

(б) Let b be some fixed natural number, and with each natural number n — ft let there be associated 
a proposition P(n). Then P(n) is true for all values of n provided: 

(i) P(6) is true. 

(ii) For each m> b the assumption that P(fc) is true for all A; G N such that b — k < m 
implies P(m) is true. 



Chapter 4 


The Integers 

INTRODUCTION 

The system of natural numbers has an obvious defect in that, given m,s G N, the 
equation m + x = s may or may not have a solution. For example, m + x = m has no 
solution (see Problem 7, Chapter 3) while m + x = m* has the solution x = 1. Everyone 
knows that this state of affairs is remedied by adjoining to the natural numbers (then 
called positive integers) the additional numbers zero and the negative integers to form 
the set / of all integers. 

In this chapter it will be shown how the system of integers can be constructed from 
the system of natural numbers. For this purpose, we form the product set 

L = NxN = sGN, mGN} 

Now we shall not say that (s, m) is a solution of m + x = 8. However, let it be perfectly 
clear, we shall proceed as if this were the case. Notice that if (s, m) were a solution of 
m + x - s, then (s, m) would also be a solution of m* + x - s* which, in turn, would 
have ( s*,m*) as solution. This observation motivates the partition of L into equivalence 
classes such that ( s,m ) and (s*,m*) are members of the same class. 

BINARY RELATION ~ 

Let the binary relation read “wave”, be defined on all (s, m), (t, n) G L by 
(s,m) ~ {t,n) if and only if s + n = t + m 

Example 1: (a) (5,2) ~ (9,6) since 5 + 6 = 9+2 

(6) (5,2) ^(8,4) since 5 + 4 *=8 + 2 

(c) (r, r) ~ (*, *) since r + 8 = 8 + r 

(d) (r*,r) ~ («*,*) since r* + 8 = 8* + r 

(e) (r*, 8*) ~ (r,s) since r* + » = r+s* 
whenever r,s G IV. 

Now ~ is an equivalence relation (see Problem 1) and thus partitions L into a set of 
equivalence classes $ = {[s, m], [t,n], ...} where 

[s,m\ = {(a, b) : (a, b) G L, (a,b) ~ (s,m)} 

We recall from Chapter 2 that (s,m) G [s, to] and that, if (c,d) G [s,m ], then [c,d\ = 
[s,m]. Thus [s,m] - [t,n] if and only if (s,m) ~ (t,n) 

It will be our purpose now to show that the set $ of equivalence classes of L relative 
to ~ is, except for the symbols used, the familiar set I of all integers. 
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ADDITION AND MULTIPLICATION ON g 

Addition and multiplication on g will be defined respectively by 

(i) [s, m\ + [t, n] = [(s + 1), ( m + »)] 

(ii) [s,m]*[t,n] = [{s-t + m-n), (s-n + m-t)] 
for all [s,m],[t,n] E g. 

An examination of the right members of (i) and (ii) shows that the Closure Laws 
Ai. x + y G g for all x,y G g 

and 

Mi. x-yEg for all x,y G g 

are valid. 

In Problem 3, we prove 

Theorem I. The equivalence class to which the sum (product) of two elements, one 
selected from each of two equivalence classes of g, belongs is independent 
of the particular elements selected. 

Example 2: If (a, b), (e,d) € [*,»t] and (e,f), (g, h) € [t,n], we have not only 

[o,6] = [c,d] = [s,ro] and [e,f\ = [g,h\ = [t,n] 
but, by Theorem I, also 

[a,b] + [e,f] = [e,d\ + [g,h] = [B,m\ + [t,n] 
and [«,6] • [e,f] = [c,d] • [g,h] = [s,m] • [t,n] 

Theorem I may also be stated as follows: Addition and multiplication on g are 
well-defined. 

By using the commutative and associate laws for addition and multiplication on N, it 
is not difficult to show that addition and multiplication on g obey these same laws. The 
associative law for addition and one. of the distributive laws are proved in Problems 4 and 5. 


THE POSITIVE INTEGERS 

Let r EN. From 1+r = r* it follows that r is a solution of 1 +x = r*. Consider 
now the mapping 

[»M]«*«, nEN ( 1) 

For this mapping, we find 

[r*,l] + [s\l] = [(r* + s*), (1 +1)] = [(r + s)M] r + s 

and 

[rM].[«M] = [(r*• s* + 1 • 1), (r*• 1 + s*• 1)] = [(r-a)*,l] ** r-s 

Thus, ( 1 ) is an isomorphism of the subset {[m*,1] :nEN} of g onto N. 

Suppose now that [s,m) = [r*, 1], Then (s, m) ~ (r*, 1), s = r + m, and s>m. This 
suggests defining the set 7 + of positive integers by 

I + — {[s,m]: [s,m] G g, s > m} 


In view of the isomorphism ( 1) the set / + may be replaced by the set N whenever the 
latter is found more convenient. 
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ZERO AND THE NEGATIVE INTEGERS 

Let r,s G N. Now [r, r] = [s, s] for any choice of r and s, and [r, r ] = [s, t] if and 
only if t = s. We now define the integer zero, 0, to correspond to the equivalence class 
[r,r], r G N. Its familiar properties are 

[s,m\ + [r,r] = [s,m] and [s, m\ • [r, r] = [r,r\ 
proved in Problems 2(6) and 2(c). The first of these leads to the designation of zero as 
the identity element for addition. 

Finally, we define the set J~ of negative integers by 


/- = {[s,m]\ [s,m\Gg,s<m) 


It follows now that for each integer [a, 6], 
that (see Problem 2(d)) 

V [a, 6] + [6, 


a ^ b, there exists a unique integer [b, a] such 
a] = [r,r] 0 (2) 


We denote [6, a] by - [a, 6] and call it the negative of [a, 6]. The relation ( 2 ) suggests the 
designation of [6, a] or ~[a,b] as the additive inverse of [a, 6]. 


THE INTEGERS 

Let p, q G N. By the Trichotomy Law for natural numbers, there are three possibilities: 

(а) p = q, whence [p, q] = [q, p] ** 0. 

(б) p < q, so that p + a = q for some aGN; then p + a* = q +1 and [q,p] = [a*,l] ** a- 
(c) p > q, so that p = q + a for some aGN and [p, q] a. 

Suppose [p,q] n GN. Since [q,p] = ~[p,q], we introduce the symbol -n to 
denote the negative of n G N and write [q,p] ~ -». Thus, each equivalence class of g 
is now mapped onto a unique element of / = (0, ±1, ±2, ...}. That g and 1 are isomorphic 
follows readily once the familiar properties of the minus sign have been established. In 
proving most of the basic properties of integers, however, we shall find it expedient to 
use the corresponding equivalence classes of g. 

Example 3: Let a, b S /. Show that (-a) 'b = -{a • 6). Let a ** [s, m] so that -a ** [m, s] and 
let 6 [t,n]. Then 

(-o) • 6 <-* [m, s] • [t, m] = [(wi • t + * • n), (m-n + 8-1)} 
while o • b [*, «i] • [t, n] = [(a • t + w n), (8 • n + m • t)] 


Now ~(a ’b) ** [(«• n + m • t), (s • t + m • w)] [(-o) • 6] 


and so 


(—a) • 6 = —(o • b) 


See Problems 6-7. 


ORDER RELATIONS 

For a,b G I, let a [s,m] and 6 <h> [t,n]. The order relations “<” and “>” for 
integers are defined by 

a < b if and only if (s + «) < (t + m) 
and a > b if and only if ( s + n)>(t + m) 

In Problem 8, we prove the Trichotomy Law. For any a, b G I, one and only one of 
(a) a — b, (b) a < b, ( c) a > 6 


is true. 
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When a,b,c G I, we have: 

( 1 ) a + c < b + c if and only if a < b. 

( 1 ') a + c > b + c if and only if a > b. 

(2) If c > 0, then a- c <b' c if and only if a <b. 

(2') If c > 0, then a-Ob'C if and only if a> b. 

(3) If c < 0, then a • c < b • c if and only if a> b. 

(3') If c < 0, then a- c>b- c if and only if a <b. 

For proofs of (1') and (3), see Problems 9-10. 
The Cancellation Law for multiplication on I, 

M 4 . If z ¥= 0 and if X’Z = yz, then x — y 

may now be established. 

As an immediate consequence, we have 
Theorem II. If a,b G 1 and if a-b = 0, then either a = 0 or b = 0. 

For a proof see Problem 11. 

The order relations permit the customary listing of the integers 
..., -5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 5 , ... 
and their representation as equally spaced points on a line. Then “a < b” means “a lies 
to the left of b”, and " a > b” means “a lies to the right of b”. 



Fig. 4-1 

From the above listing of the integers, we note 
Theorem III. There exists no n G I + such that 0 < n < 1. 

This theorem (see Problem 12 for a proof) is a consequence of the fact that the set 7 + of 
positive integers (being isomorphic to N) is well-ordered. 

SUBTRACTION 

Subtraction, on I is defined by a — b — a + (—b). Subtraction is clearly a binary 
operation on /. It is, however, neither commutative nor associative although multiplication 
is distributive with respect to subtraction. 

Example 4: Prove: a-(b-c) ¥= ( a-b)-c for a,b,c G I and e ¥= 0. 

Let a <-» [ s,m ], b [t,n], and c [u,p\. Then 

b — c = 6 + (—c) [(t + p), (n + w)] 

—(b — c) [(n + it), (t + p)] 

and a ~ (b — e) = a+ (-(b - c)) <-> [(a + n + it), (m + t + p)] 

while a — b — a+ (—6) [(a + n), (to + t)] 

and (a — 6) — c = (o + b) + (—c) [(* + n + p), (to + t + u)\ 

Thus, when e ¥= 0, a — (b — e) ¥= (a —b) — c. 
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ABSOLUTE VALUE \a\ 

The absolute value, "|o|”, of an integer a is defined by 


M 

Thus, except when a - 0, |a| € /+. 
The following laws 


a when a — 0 
-a when a < 0 


(1) —1«| - a ^ |a| 

(3) |a| — |6| — |a + 6| 

(4) |a| — |&] — |a — 6| 


(2) |a-b| = |a| • |6| 

(3') |a + 6| ^ |a| + |6| 
(4') |a — b| — |a| + |6j 


are evidently true when at least one of a,b is 0. They may be established for all a,b G I 
by considering the separate cases as in Problems 14 and 15. 


ADDITION AND MULTIPLICATION ON I 

The operations of addition and multiplication on I satisfy the laws A1-A4, M1-M4, and 
D1-D2 of Chapter 3 (when stated for integers) with the single modification 

M 4 . Cancellation Law: If wp = n‘p and if p^OGJ, then m — n for all 
m,n € /. 

We list below two properties of I which N lacked 

As. There exists an identity element, 0 6/, relative to addition, such that n + 0 = 
0 + « = n for every n G /. 

A s . For each n G / there exists an additive inverse, —« €= I, such that n + (— n) = 
(-») + n = 0. 

and a common property of N and I 

M 5 . There exists an identity element, 1 € /, relative to multiplication, such that 
1 • n = w • 1 = w for every n G I. 

By Theorem III, Chapter 2, the identity elements in As and Ms are unique; by Theorem 
IV, Chapter 2, the additive inverses in As are unique. 


OTHER PROPERTIES OF INTEGERS 

Certain properties of the integers have been established using the equivalence classes 
of $. However, once the basic laws have been established, all other properties may be 
obtained using the elements of I themselves. 

Example 5: Prove: For all o,6,cG/, 

(o) a • 0 = 0 • a — 0 (6) a(—b) = -(06) (c) o(& — c) = ab — ac 

(a) a + 0 = a (A^) 

Then a*a + 0 = a»a = a(a + 0) = o*a + a*0 (Dj) 

and 0 = a*0 (A4) 

Now, by M 2 , 0 • o = a • 0 = 0, as required. However, for reasons which 
will not be apparent until a later chapter, we will prove 
0 *o = o*0 

without appealing to the commutative law of multiplication. We have 
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a • a + 0 = a-a = (a + 0)a = a • a + 0 • a (D 2 ) 

hence, 0 = 0*a 

and 0*o = o*0 


(6) 0 = a • 0 = a[b + (-6)] = a-b + a(-b) (Dj) 

thus, a(—b) is an additive inverse of a • b. But —(a • 6) is also an additive in¬ 
verse of a • 6; hence, 

a(-b) - -(a-b) (Theorem IV, Chapter 2) 


(e) a(b —e) = a[6 + (—c)] = ab + a(—e) (D t ) 

= ab 4- (—ae) ((h) above) 

= ab — ac 

Note: In (c) we have replaced a • b and —(a - e) by the familiar ab and —oc 
respectively. 


MULTIPLES AND POWERS 

The section with this title in Chapter 3 (Page 33) may be repeated here after replacing 
N by 7 + . t 

It is to be noted that for the case S = I, we may now identify ka with k-a. Moreover, 
since I contains the additive inverse of each of its elements, we have (ix)-(xi), but not 
(ix)'-(xi)', for all m,n G /. This is largely trivial, of course, since when a,b,m,nGl 
the properties (ix) and (xi) are then the distributive laws. 


Solved Problems 

1. Show that ~ on L is an equivalence relation. 

Let (s, m), (t, n), (u, p) G L. We have 

(а) (s, m) ~ (s, m) since « + ♦» = « + »»; ~ is reflexive. 

(б) If (s,m) ~ (t, n), then ( t,n) — («, m) since each requires s + n = t + m; ~ is symmetric. 

(c) If (s, m) ~ (t, n) and (t, n) ~ (u, p), then s +» = t + m, t + p = u + n, and s + w+ t + p = 

t + m + u + n. Using A 4 of Chapter 3, Page 30, the latter equality can be replaced by 

8 + p — m + u; then (s, m) ~ («, p) and ~ is transitive. 

Thus, ~, being reflexive, symmetric, and transitive, is an equivalence relation. 

2. When s,m,p,r G N, prove: 

(а) [(r -I- p), p] = [r*, 1] (c) [s, m\ - [r, r\ = [r, r] (e) [s, m] - [ r*, r] - [s, m] 

(б) [s,m] + [r,r] = [s,m] (d) [s,m] + [m,s] = [r,r] 

(а) ((r + p), p) ~ (r*, 1) since r + p + 1 =r* + p. 

Hence, [(r + p), p] = [r*, 1] as required. 

(б) [s, m] + [r, r] = [(* + r), (m + r)]. Now ((* + r), (m + r)) ~ (s, m) since (s + r) + m — s + (m +r). 
Hence, [(s + r), (m + r)] = [*,-m] + [r,r] = [«,«]. 

(c) [*, m] - [r,r] = [(g • r + m • r), (s • r + m • r)] = [r, r] since s’r + m*r + r = g-r + ?n*r + r. 

(d) [g,m] + [m,g] = [(g + m),(s + m)] = [r,r] 

(e) [8,m] - [r*,r] = [(g«r* + m-r), (s-r + m•»•*)] = [g,m] since 8-r* + m-r + m = 8 + 8*r + 

m*r* = + r*. 
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3. Prove: The equivalence class to which the sum (product) of two elements, one selected 
from each of two equivalence classes of g, belongs is independent of the elements 
selected. 

Let [a, 6] = [8, to] and [c,d] = [t.n]. Then (a,b) ~ (s,m) and (c,d) ~ (t,n) so that 

a + to = * + 6 and e + n = t + d. We shall prove: 

(а) [a, 6] + [e, d] = [s,m] + [t,w], the first part of the theorem; 

(б) a'C+b'd + 8'n + m.'t = a*d+6*c + 8*t + TO*», a necessary lemma; 

(c) [a, 6] • [e, d] = [s, to] • [t.n], the second part of the theorem. 

(a) Since a+m+c+n= s + 6 + t+d, 

(a+c)+ (»» + «) = (s+f) + (6 + d) 

((a + e), (6 + d)) ~ (<8 + t), (to + «)) 

[(a + e), (6 + d)] = [(s + t), (to + «)] 
and [«,&) +[c,d] = [s,to] + [t.n] 

(b) We begin with the evident equality 

(o + to) • (c +t) + (s + 6) • (d + n) + (c + ») • (a + 8) + (d+t) • (6 + to) 

= (8 + 6) • (e + t) + (a + to) • (d + n) + (d+t) • (a+ 8) + (« + »)• (6 + m) 

which reduces readily to 

2(o*c + 6*d + 8*n + TO*t) + (o*t + wc + 8*d+6*n) + (s*e + wa+6*t + TO*d) 

= 2(a*d + 6*c + s*t + TO*n) + (o*t + TO-c + 8-d+6*n) + (s • e + «• a + b • t + to- d) 

and, using the Cancellation Laws of Chapter 3, to the required identity. 

(e) From (6), we have 

(o • c + 6 • d) + (s • n + to ♦ t) = (s • t + w • w) + (o • d + 6 • e) 

Then ((a• c + 6 • d), (a• d + 6 • c)) ~ ((8 ’t + to* »), (8 •« + w t)) 

[(a • c + b • d), (a • d + 6 • c)] = [(8 • t + to •«), (8 • n + w • t)] 
and so [«,&] ’ [«. d] = [s,to] • [t.n] 

4. Prove the Associative Law for addition: 

([s,m] + {t, n]) + [u, p] = [s,m] + ([£,»] + K P}) 
for all [s, m], [t, n ], [u, p] G g. 

We find 

([s,to] + [t,n]) + [w,p] = [(s + t), (to + n)\ + [n, p] = [(s + t + M), (m + n + p)} 

while [8 ,to] + ([f,w] + [«,p]) = [e, to] + [(t + m), (n + p)] = [(s + t + u), (m + n + p)] 

and the law follows. 


5. Prove the Distributive Law D 2 : 

([», m] + [t,n}) • [u,p) = [s,m\ • [u,p] + [t.n] • [u.p] 

for all [s,m\,[t,n],\u,p] G g. 


We have 

([8 , to] + [ t.n ]) • [u, p] 


[(s + t), (TO + n)] • [u, p] 

[((s + t)*n + (to + n) • p), ((s + t)*p + (to + n) • n) ] 

[(s• u + t • u + w• p + w• p), (s’p + t'p + m-u + n'v)} 

[((8*n + TO-p) + (t • m + n • p)), ((s ■ p + to • u) + (t • p + n • n)) ] 
[(s • u + to • p), (s • p + to • n)] + [(t • u + n • p), (t • p + n • n)] 

[s, to] • [n,p] + [t.n] • [u.p] 
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6. (a) Show that a 4- (—a) = 0 for every a GI. 

Let a then — a [m, 8], 

a + (—a) <-» [g,m] + [m,g] = [(* + m), (m + *)] = [r,r] ** 0 
and a + (—a) = 0. 

(b) If x + a = b for a,b G I, show that x = 6 +(-a). 

When * = b + (-a), x + a = (b + (-a)) + a = b + ((-a) + a) = b; thus, x = b + (-a) is a 
solution of the equation x + a = b. Suppose there is a second solution y. Then y + a = b = 
x + a and, by A4, y = x. Thus the solution is unique. 

7. When a.bGl, prove: (1) (-a) + (-6) = -(« + &), (2) (-a)• (-5) = a- b. 

Let a«-»[s,m] and 6 [t,w]; then —a [m, *] and —6 <-»[«, t]. 

(1) (—a) + (—6) ++ [m,«] + [«,<] = [(*» + *), (•+*)] 

and a + 6 ** [*,m] + [t,n] = [(* + 1), (m +»)] 

Then —(o +6) *+ [(w +«), (* + t)] (—«) + (-6) 

and (—0) + (—6) = —{»+ 6) 

(2) (- 0 ) • (-6) [m,*]•[«,t] = [(m •« + «•*), («t • t + s • w)] 

and a • 6 ++ [*, m] • [t, w] = [(* • t + m • n), (s • n + m • t)] 

Now [(m •» + «•<),(»»•< + «•«)] = [(«• t + to • w), (8 • n + m • t)] 

and (— 0 ) • (~b) = o • 6 

8. Prove the Trichotomy Law: For any a,b GI one and only one of 

(a) a= b, (b) a < b, (c) a>b 

is true. 

Let a ** [g, m] and b ^ [*,«]; then by the Trichotomy Law of Chapter 3 , Page 32 , one and 
only one of (a) 8 + n = t + m and a = b, (b) 8 + n < t + m and a < b, (c) 8 + n > t + m and 
a > b is true. 

9. When a,b,c G I, prove: a + c>6 + c if and only if a > b. 

Take a [s,m], 6 [t,n], and e ++ [u,p]. Suppose first that 

o + c > b + c or ([g,m] + [«,p]) > ([t,n] + [u,p]) 

Now this implies [(s + «), (m + p)] > [(t + u), (n + p)] 

which, in turn, implies (g + m) + (w + p) > (t + u) + (m + p) 

Then, by Theorem II', Chapter 3 , Page 33 , (g + n) > (t + m) or [g, w] > [t, n] and o > 6, as required. 
Suppose next that a > b or [a, m] > [t, *]; then (g + n) > (t + m). Now to compare 
o + c <-> [(g + m), (m + p)] and b + c [(t + it), (n + p)] 


compare 


[(g + it), (m + p)] and [(t + it), (« + p)] 

(g + it) + (n + p) and (t + it) + (m + p) 

(g + n) + (it + p) and (t + m) + (u + p) 
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Since (s + n) > (t + to), it follows by Theorem II', Chapter 3, Page 33, that 
(* + n) + (M + p) > (t + m) + (u + p) 

Then (« + w) + (n + p) > (t + u) + (m + p) 

[(* + it), (m + p)] > [(i + m), (n + p)] 

and a + e > b + c 

as required. 

10. When a,b,cGl, prove: If c < 0, then a-c < b'C if and only if a > b. 

Take a <-»[*, to], b [t,w], and c +* [it,p] in which u < p since e < 0. 

(o) Suppose a *c < b • e; then 

[(«• m + to • p), (s • p + to • it)] < [(«• u + »• p), (t • p + n ■ u)] 

and (* • m + to • p) + (t • p + «• u) < (t • tt + n • p) + (s • p + to • tt) 

Since u < p, there exists some k£ N such that u + k = p. When this replacement for p is 
made in the inequality immediately above, we have 

(8'u. + m.‘u + m‘k + t , u + t , k + n'u) < (t'u + n’u + n’k + s'u + s'k + m’u) 
whence m*k + t*fc < «*fc + s*k 

Then (to + t) • k < (n + ») • k 

to + t < n + s 
8 + n > t + m 

and a > b 

( b) Suppose a > b. By simply reversing the steps in (a), we obtain a *e < b • c, as required. 

11. Prove: If a,b G I and a* b = 0, then either a = 0 or 6 = 0. 

Suppose ii/O; then a • 6 = 0 = 0 and by M 4i 6=0. Similarly, if 6 # 0 then a = 0. 

12. Prove: There exists no nG /+ such that 0 < n < 1. 

Suppose the contrary and let m G /+ be the least such integer. From 0 < to < 1, we have by 

the use of (2), Page 41, 0 < to 2 < to < 1. Now 0 < to 2 < 1 and to 2 < w contradict the assumption 

that w is the least, and the theorem is established. 

13. Prove: When a,b G I, then a < b if and only if a- b < 0. 

Let a [*, to] and 6 <-> [t ,«]. Then 

a-b = a + (—6) [a,w] + [n,t] = [(s + «), (to + t)] 

If a < b, then s + n < m+t and a-b < 0. Conversely, if a — b < 0, then s + n < m + t and 
a < b. 

14. Prove: |a + 6| — |a| + [6| for all a,bGl. 

Suppose a > 0 and b > 0; then |o + 6| = a + b = |o| + |6|. 

Suppose a<0 and 6 < 0; then |o + 6| = —(a + b ) = -a + (—6) = |o| + |6|. 
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Suppose a > 0 and 6 < 0 so that |a| = a and | 6 | = — 6 . Now, either a + b - 0 and |a + b\ - 
o < M + | 6 | or 

a + b < 0 and |a+ 6 | = -(a + 6 ) = -a + (-ft) = — 1 »| + | 6 | < M + |»| 

or a + 6 > 0 and |a + 6 | = a + b = a — (— 6 ) = |a| — | 6 | < |a| + |b| 

The case a < 0 and & > 0 is left as an exercise. 

15. Prove: |a-6| = |o|*|6| for all a,bGl. 

Suppose a > 0 and b > 0; then |o| = a and |b| = b. Then \a • ft] = a • b = |o| • |b|. 

Suppose a < 0 and b < 0; then |o| = —a and | 6 | = —b. Now a • b > 0; hence, |a • 6 | = o • ft = 

(-a)-(-ft) = \a\-\b\. 

Suppose a > 0 and 6 < 0; then |a| = a and | 6 | = —ft. Since a • ft < 0, |o • 6 | = — (a • 6 ) = 

• •(- 6 ) = l«Hft|. 

The case a < 0 and ft > 0 is left as an exercise. 

16. Prove: If a and b are integers such that a • b = 1 then a and b are either both 1 or 
both —1. 

First we note that neither a nor ft can be zero. Now |a*ft| = |a| • | 6 | = 1 and, by Problem 12, 

|a| S: 1 and |ft| ^ 1. If \a\ > 1 (also, if |ft| > 1), |o| • |ft| * 1. Hence |a| = |ft| = 1 and, in view of 

Problem 7(6), the theorem follows. 


Supplementary Problems 

17. Prove: When r,sG N, 

(а) (r,r) ~ (a, a) ~ (1,1) (d) (r*,r) + (r,r*) 

( б ) (r*,r) ~ («*,«) ~ ( 2 , 1 ) (e) (r*,r) + (s.s*) 

(c) (r, r*) ~ («,«•) ~ ( 1 , 2 ) ( f) (r* • s* + 1 , r* + «*) ~ ((r • «)*, l) 

18. State and prove: (a) the Associative Law for multiplication, ( 6 ) the Commutative Law for addition, 
(c) the Commutative Law for multiplication, (d) the Cancellation Law for addition on (j. 

19. Prove: [r*,r] 1 and \r,r*] <-» —1 

20. If a G /, prove: (a) a • 0 = 0 • a = 0, ( 6 ) (-1) • a = -a, (c) -0 = 0. 

21. If a, 6 G/, prove: (a) — (— a) = +a, ( 6 ) (—o)(— 6 ) = o • 6 , (c) (—a) + 6 = — (a + (— 6 )). 

22. When 6 G / + , show that a-b < a + b for all a G /. 

23. When a, ft G 1 , prove (1), (2), (2') and (3'), Page 41, of the order relations. 

24. When a,b,cGl, prove a* (6 — c) = o *6 —a*c. 
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25. Prove: If a,b G I and a < b, then there exists some cGl + such that a+e = b. 

Hint. For a and b as represented in Problem 7, take c [(t + m), (m+s)]. 

26. Prove: When a,b,c,d G /, 

(a) —a > —6 if a < b. 

(b) a + c < b + d if a < b and e < d. 

(c) If a < (b + c), then a — b<e. 

(d) a— b = c — d if and only if a + d = b + c. 

27. Prove that the order relations are well-defined. 

28. Prove the Cancellation Law for multiplication. 

29. Define sums and products of « > 2 elements of / and show that in such sums and products parentheses 

may be inserted at will. 

30. Prove: (a) m 2 > 0 for all integers m # 0. 

(5) m 3 > 0 for all integers m > 0. 

(c) m 3 < 0 for all integers m < 0. 

31. Prove without using equivalence classes (see Example 5): 

(a) —(—a) = a 

(b) (—a)(—b) = ab 

(e) (b-c) = (b + a) — (e + a) 

(d) a(b — e) — ab — ac 

(e) ( a + b)(c + d) = (ac + ad) + (be + bd) 

(f) (a + b)(c-d) = (ac+bc) - (ad+bd) 

(g) (a — b)(e — d) — (ac + bd) — (ad + be) 
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Some Properties of Integers 

DIVISORS 

An integer a ¥> 0 is called a divisor (factor) of an integer 6 (written “a | b") if there 
exists an integer c such that b = ac. When a \ b we shall also say that b is an integral 
multiple of a. 

Example 1: (a) 2 | 6 since 6 = 2*3 

(6) -3 116 since 16 = (—3)(—5) 

(c) a | 0, for all a & I, since 0 = a • 0 

In order to show that the restriction a ^ 0 is necessary, suppose 01 b. If b ¥■ 0, we 
must have b = 0 • c for some cGl, which is impossible; while if 6 = 0, we would have 
0 = 0 • c, which is true for every c G /. 

When b,c\x,y e /, the integer bx + cy is called a linear combination of 6 and c. In 
Problem 1, we prove 

Theorem I. If a I 6 and a I c then a I (bx + cy) for all x,y G I. 

See also Problems 2, 3. 

PRIMES 

Since a • 1 = (-a)(-l) = a for every aGl, it follows that ±1 and ±a are divisors 
of a. An integer p # 0, ±1 is called a prime if and only if its only divisors are ±1 and ±p. 

Example 2: (o) The integers 2 and —6 are primes, while 6 = 2*3 and —39 = 3(—13) are not 

primes. 

(6) The first 10 positive primes are 2,3,6,7,11,13,17,19,23,29. 

It is clear that — p is a prime if and only if p is a prime. Hereafter, we shall restrict 
our attention mainly to positive primes. In Problem 4, we prove: 

The number of positive primes is infinite. 

When a = be with |6| > 1 and |c| > 1, we call a composite. Thus every integer 
a ¥* 0, ±1 is either a prime or a composite. 

GREATEST COMMON DIVISOR 

When a \ b and a \ c, we call a a common divisor of 6 and c. When, in addition, every 
common divisor of 6 and c is also a divisor of a, we call a a greatest common divisor 
(highest common factor) of 6 and c. 

Example 3: (a) ±1, ±2, ±3, ±4, ±6, ±12 are common divisors of 24 and 60. 

(6) ±12 are greatest common divisors of 24 and 60. 

(c) The greatest common divisors of 6 = 0 and c ¥= 0 are ±c. 
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Suppose c and d are two different greatest common divisors of a ¥= 0 and 6^0. Then 
c | d and d \ c; hence, by Problem 3, c and d differ only in sign. As a matter of convenience, 
we shall hereafter limit our attention to the positive greatest common divisor of two 
integers a and 6 and use either d or (a, b) to denote it. Thus, d is truly the largest (greatest) 
integer which divides both a and b. 

Example 4: A familiar procedure for finding (210,510) consists in expressing each integer as a 
product of its prime factors, i.e., 210 = 2 • 3 • 5 • 7, 510 = 2 • 3 • 5 • 17, and forming 
the product 2 • 3 • 5 = 30 of their common factors. 

In Example 4 we have tacitly assumed (a) that every two non-zero integers have a 
positive greatest common divisor and ( b) that any integer a > 1 has a unique factoriza¬ 
tion, except for the order of the factors, as a product of positive primes. Of course, in 
(6) it must be understood that when a is itself a prime, “a product of positive primes” 
consists of a single prime. We shall prove these propositions later. At the moment, we 
wish to exhibit another procedure for finding the greatest common divisor of two non-zero 
integers. We begin with: 

The Division Algorithm. For any two non-zero integers a and b, there exist unique 
integers q and r, called respectively quotient and remainder, such that 

a = bq + r, 0 ^ r < |6| ( 1) 

For a proof, see Problem 5. 

Example 5: (a) 780 = -48(-16) + 12 (c) 826 = 25*33 + 1 

(6) -2805 = 119(—24) + 51 (d) 768 = 242(3) + 32 

From ( 1) it follows that 6|a and (a,b) = b if and only if r = 0. When r# 0 it is 
easy to show that a common divisor of a and b also divides r and a common divisor of 
b and r also divides a. Then (a,6)|(6,r) and (6,r)|(o,6) so that, by Problem 3, 
(a,b) = (b,r). Now either r\b (see Examples 5(a) and (5c)) or r/6 (see Examples 5(6) and 
5(d)). In the latter case, we use the division algorithm to obtain 

6 = rqi + n, 0 < ri < r {2) 

Again, either n \ r and (a, 6) = ri or, using the division algorithm, 

r = n< 7 2 + r 2 , 0 < r 2 < n (3) 

and (a, 6) = (6, r) = (r, n) = (r,, r 2 ). 

Since the remainders n, r 2 ,..., assuming the process to continue, constitute a set of 
decreasing non-negative integers there must eventually be one which is zero. Suppose 
the process terminates with 

(k) r k - s = ric- 2 *q fc -i + r k -\ 0 < r k -i < r fc - 2 

(k +1) r k - 2 = r k -i’ q k + r k 0<r k <r k -i 

(k + 2) r k -i - r k ’q k +1 + 0 

Then (a, 6) = (6,r) = (r,r,) = ••• = (r k - 2 ,r fc -i) = (r fc -,,r fc ) = r k . 

Example 6: (a) In Example 5(6), 51/ 119. Proceeding as in (S), we find 119 = 51(2) + 17. Now 

17 | 61; hence, (-2805,119) = 17. 

(6) In Example 5(d), 32/242. From the sequence 

768 = 242(3) + 32 

242 = 32(7) + 18 

32 = 18(1) + 14 

18 = 14(1) + 4 

14 = 4(3) + 2 
4 = 2(2) 

we conclude (758, 242) = 2. 
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Now solving ( 1) for r = a - bq = a + (—q)b = Wia + nib ; and 
substituting in ( 2), n — b — rq t = b — (mia + nib)q t 

-- -m,q t a + (1 — n t qi)b -- m 2 a + n 2 b 
substituting in (8), r 2 = r — riq 2 = (nua + nib) — (m 2 a + n-ib)q% 

— {mi — q 2 m^)a + («i — qtfi^b = m 3 a + n 3 b 


and continuing, we obtain finally 
Thus, we have 


m k + ia + nic+ib 


Theorem II. When d = (a, 6), there exist m,n G I such that d = (a, b) = ma + nb. 


Example 7: Find (726, 276) and express it 
From 

726 = 275-2 + 176 

275 = 176*1 + 99 

176 = 99*1 + 77 

99 = 77*1 + 22 

77 = 22*3+11 

22 = 11 • 2 

Thus, m = 11 and n = -29. 


the form of Theorem II. 

We obtain 

11 = 77 - 22 • 3 = 77 - (99 - 77) • 3 
= 77-4-99*3 
= (176 - 99) • 4 - 99 • 3 
= 176*4 - 99*7 
= 176 • 4 - (275 - 176) • 7 
= 176 • 11 - 275 • 7 
= (726 - 275 • 2) • 11 - 275 • 7 
= 11 • 726 + (-29) • 275 


Note 1. The procedure for obtaining m and « here is an alternate to that used in 
obtaining Theorem II. 

Note 2. In (a, 6) = ma + nb, the integers m and n are not unique; in fact, (a, 6) = 
(m + kb)a + (« - ka)b for every k G N. 

See Problem 6. 

The importance of Theorem II is indicated in 

Example 8: Prove: If a \ e, if b \ c, and if (o, 6) = d, then ab | cd. 

Since a \ c and 6 | e, there exist integers s and t such that e = as = bt. By 
Theorem II there exist m,n e I such that d = ma + nb. Then 
cd = cma + cnb = btma + asnb = ab(tm + sn) 

and ab \ cd. 

A second consequence of the Division Algorithm is 

Theorem III. Any non-empty set K of integers which is closed under the binary opera¬ 
tions addition and subtraction is either {0} or consists of all multiples of 
its least positive element. 


An outline of the proof when K + {0} follows. Suppose K contains the integer a 0. 
Since K is closed with respect to addition and subtraction, we have: 

(i) a - a = 0 € K 

(ii) 0 — a — —a G K 

(iii) K contains at least one positive integer. 

(iv) K contains a least positive integer, say, e. 

(v) By induction on n, K contains all positive multiples ne of e (show this). 

(vi) K contains all integral multiples me of e. 

(vii) If b G K, then b = q • e + r where 0 — r < e; hence, r = 0 and so every element 
of K is an integral multiple of e. 
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RELATIVELY PRIME INTEGERS 

Forgiven a,bGl, suppose there exist m,n G I such that am + bn = 1. Now every 
common factor of a and b is a factor of the right member 1; hence, (a, b) = 1. Two 
integers a and b for which (a, 6) = 1 are said to be relatively prime. See Problem 7 

In Problem 8, we prove 

Theorem IV. If (a,s) = (b,s) = 1, then (ab,s) = 1. 


PRIME FACTORS 

In Problem 9, we prove 

Theorem V. If p is a prime and if p \ ab, where a, b G /, then p | a or p\b. 

By repeated use of Theorem V there follows 

Theorem V'. If p is a prime and if p is a divisor of the product a • b • c • ... • t of n 
integers, then p is a divisor of at least one of these integers. 

In Problem 10, we prove 

The Unique Factorization Theorem. Every integer a > 1 has a unique factorization, 
except for order, 

(a) a = p» • p* • Ps • ... • P» 

into a product of positive primes. 

Evidently, if (a) gives the factorization of a, then 

-o = -(pi • pi • pa • ... * P«) 

Moreover, since the p’s of (a) are not necessarily distinct, we may write 
a = p“< • p“* • p“» • ... • p“* 

where each a t ^ 1 and the primes Pi, P 2 , Pa,..., p« are distinct. 

Example 9: Express each of 2,241,766 and 8,666,074 as a product of positive primes and obtain 
their greatest common divisor. 

2,241,756 = 22 • 3 4 • 11 • 17 • 37 and 8,566,074 = 2 • * • 11* • 1# • 28 
Their greatest common divisor is 2 • 3 4 • 11. 


CONGRUENCES 

Let m be a positive integer. The relation “congruent modulo m”, (= (mod m)), is 
defined on all a, b G I by a = b(mod m) if and only if m | (a - b). 


Example 10: (a) 89 = 25(mod 4) since 4 | (89 —25) = 64 

(6) 89 = l(mod 4) since 4 | 88 


(e) 24 # 3(mod 5) since 5/21 
(/) 243 & 167(mod 7) since 7/76 


(c) 25 = l(mod 4) since 4 | 24 

(d) 153 = —7(mod 8) since 8 1160 


(g) Any integer a is congruent 
modulo to to the remainder 
obtained by dividing a by to. 


An alternate definition, often more useful than the original, is a = b(mod m) if and 
only if a and b have the same remainder when divided by m. 

As immediate consequences of these definitions, we have: 

Theorem VI. If a = b(mod m) then, for any n G /, mn + a = b(mod m) and conversely. 
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Theorem VII. If ft = 6(mod m), then, for all x G 7, a + x = b + x(mod m) and ax = 
6x(mod m). 


Theorem VIII. 
Theorem IX. 


If ft = 6(mod m) and c -- e(mod m), then a + c = b + e(mod m), a — c = 
b — e(mod m), ac * be(modm). gee Problem n 

Let ( c,m ) = cL and write m = mid. If ca - c6(mod m), then a = &(modmi) 
and conversely. 

For a proof, see Problem 12. 


As a special case of Theorem IX, we have 

Theorem X. Let (c, m) = 1 . If ca = c6(modw), then a = 6(mod m) and conversely. 

The relation = (mod m) on 7 is an equivalence relation and separates the integers into 
m equivalence classes, [0], [1], [2],..., [m —1], called residue classes modulo m, where 
[r] = {a: a e /, ft s r(modw)} 


Example 11: The residue classes modulo 4 are: 

[0] = {..., -16, -12, —8,,-4, 0, 4, 8, 12, 16, ...} 

[1] = {...,-16,-11,-7,-3,1,6,9,13,17,...} 

[2] = {..., -14, -10, -6, -2, 2, 6,10, 14, 18, ...} 

[3] = {..., -13, -9, -5, -1, 3, 7,11, 16, 19, ...} 


We will denote the set of all residue classes modulo m by I/(m). For example, 7/(4) = 
{[°]» [!]. [2]. [3]} and I/(m) = {[0], [ 1 ], [2], [3], ..., [m- 1 ]}. Of course, [3] € 7/(4) = 
[3] € Him) if and only if m = 4. Two basic properties of the residue classes modulo m are: 

If a and b are elements of the same residue class [s], then ft = 6(mod m). 

If [s] and [<] are distinct residue classes with a G [s] and b G [t], then a # 6(mod m). 


THE ALGEBRA OF RESIDUE CLASSES 

Let “©” (addition) and “O” (multiplication) be defined on the elements of 7/(m) as 
follows: 

[ft] ©[6] = [ft + b] 

[a] O [6] = [ft-6] 

for every [a], [6] G I/(m). 

Since © and Q on I/(m) are defined respectively in terms of + and • on 7, it follows 
readily that © and O satisfy the laws At-A 4 , and Di-D 2 as modified in Chapter 4, 

Page 42. 


Example 12: The addition and multiplication tables for 7/(4) are: 


©0123 
0 0 12 3 

112 3 0 

2 2 3 0 1 

3 3 0 1 2 


and 


Q 0 1 2 3 
0 0 0 0 0 
10 12 3 
2 0 2 0 2 
3 0 3 2 1 


Table 5-1 


Table 5-2 


where, for convenience, [0], [1], [2], [3] have been replaced by 0,1,2,3. 
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LINEAR CONGRUENCES 

Consider the linear congruence 
(6) ax = 6(mod to) 

in which a, b, to are fixed integers with to > 0. By a solution of the congruence we shall 
mean an integer x = xi for which to \ (ax t — b). Now if Xi is a solution of (b) so that 
to | (axi — b) then, for any k G I, m \ (a(xi + km) — b) and x% + km is another solution. 
Thus, if X\ is a solution so also is every other element of the residue class [xi] modulo m. 
If then the linear congruence (b) has solutions, they consist of all the elements of one or 
more of the residue classes of I/(m). 

Example 13: (a) The congruence 2* = 3(mod 4) has no solution since none of 2 • 0 — 3, 2 • 1 — 3, 

2*2 — 3, 2*3-3 has 4 as a divisor. 

(6) The congruence 3sc = 2(mod 4) has 6 as a solution and, hence, all elements of 
[2] €= 1/(4) as solutions. There are no others. 

(c) The congruence 6x = 2(mod 4) has 1 and 3 as solutions. Since 3 # l(mod 4), 
we shall call 1 and 3 incongruent solutions of the congruence. Of course, all ele¬ 
ments of [1], [3] 6 1/(4) are solutions. There are no others. 

Returning to (b), suppose (a, m) = 1 = sa + tm. Then b = bsa + btm and Xi = bs is 
a solution. Now assume x% # a;i(mod m) to be another solution. Since ax i = 5(mod m) 
and axi = 6(mod to), it follows from the transitive property of = (mod to) that axi = 
az 2 (mod to ). Then m\a(xi-X2) and Xi = a; 2 (mod to ) contrary to our assumption. Thus, 

(b) has just one incongruent solution, say x\, and the residue class [* 1 ] G I/(m), also called 
a congruence class, includes all solutions. 

Next, suppose that (a, to) = d = sa 4- tm, d > 1. Since a = aid and to = mid, it 
follows that if (b) has a solution x = x\ then ax 1 — b = mq = m\dq and so d [ b. Con¬ 
versely, suppose that d = (a, to) is a divisor of b and write b = bid. By Theorem IX, 
Page 53, any solution of (5) is a solution of 

(c) aix = &i(mod TOi) 

and any solution of (c) is a solution of (b). Now ( 01 , TOi) = 1 so that (c) has a single 
incongruent solution and, hence, (b) has solutions. We have proved the first part of 

Theorem XI. The congruence ax = 6(mod to) has a solution if and only if d = (a, to) 
is a divisor of b. When d \ b, the congruence has exactly d incongruent 
solutions (d congruence classes of solutions). 

To complete the proof, consider the subset 

S = (Xi, X\ + toi, Xi + 2toi, xi + 3toi, ..., xi + (d — l)TOi} 

of [* 1 ], the totality of solutions of aa = 6i(mod TOi). We shall now show that no two 
distinct elements of S are congruent modulo to (thus, (6) has at least d incongruent solu¬ 
tions) while each element of [zi] -S is congruent modulo to to some element of S (thus, 
(6) has at most d incongruent solutions). 

Let Xi + stoi and Xi + tmi be distinct elements of S. Now if Xi + snu = Xi + fTOi(mod to) 
then to | (s - t)TOi; hence, d\(s-t) and s = t, a contradiction of the assumption s ¥= t. 
Thus, the elements of S are incongruent modulo to. Next, consider any element of 
[a?i] — S, say Xi + (qd + r)m x where q — 1 and 0 — r < d. Now Xi + (qd + r)mi = 
x\ + rTOi + qm = Xi + rTOi(mod to) and Xi + rmi G S. Thus, the congruence (b), with 
(a, to) = d and d \ b, has exactly d incongruent solutions. 


See Problem 14. 
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POSITIONAL NOTATION FOR INTEGERS 
It is well known to the reader that 


827,016 = 8 • 10 5 + 2 • 10 4 + 7-10 3 + 0-10 2 + 1-10 + 6 

What is not so well known is that this representation is an application of the congruence 
properties of integers. For, suppose a is a positive integer. By the division algorithm, 
a = 10 • q 0 + r 0 , 0 — r 0 < 10. If qo = 0, we write a = r 0 ; if qo > 0, then q 0 = 10 • qi + n, 
0 — ri < 10. Now if <71 = 0, then a = 10 • n + r 2 and we write a = rir 0 ; if qi > 0, then 
qi = 10 • q 2 + r 2 , 0 — r 2 < 10. Again, if g 2 = 0, then a = 10 2 • r 2 +10 • r\ + r 0 and we 
write a = r 2 nro; if <72 > 0, we repeat the process. That it must end eventually and we 
have 

a = 10 *-r» + 10 s-1 - r s -i + ••• + 10-ri + r 0 = r s r s -1 • • -nro 

follows from the fact that the q ’s constitute a set of decreasing non-negative integers. 
Note that in this representation the symbols n used are from the set {0,1,2,3, . . .,9} of 
remainders modulo 10. (Why is the representation unique?) 

In the paragraph above, we chose the particular integer 10, called the base, since this 
led to our system of representation. However, the process is independent of the base and 
any other positive integer may be used. Thus, if 4 is taken as base, any positive integer 
will be represented by a sequence of the symbols 0,1,2,3. For example, the integer 
(base 10) 155 = 4 3 - 2 + 4 8 -l + 4-2 + 3 = 2123 (base 4). 

Now addition and multiplication are carried out in much the same fashion, regardless 
of the base; however, new tables for each operation must be memorized. These tables 
for base 4 are: 


tf 0 12 3 

0 0 12 3 

1 1 2 3 10 

2 2 3 10 11 

3 3 10 11 12 


and 


• 0 12 3 

0 0 0 0 0 

10 12 3 

2 0 2 10 12 

3 0 3 12 21 


Table 5-3 


Table 5-4 


See Problem 15. 


Solved Problems 

1. Prove: If a \ b and a | c, then a | ( bx + cy) where x,y G 1. 

Since a \ b and a \ c, there exist integers s, t such that 6 = as and e = at. Then bx + cy = 
asx + aty — a(sx + ty) and a | (bx + cy). 


2. Prove: If a \ b and 6 += 0, then |6| — |a|. 

Since a \ b, we have b — ac for some c G I. Then |6| = |a| • |c| with |c| — 1. Since |c| — 1, it 
follows that |o| • |c| ^ |a|, that is, |6| - |a). 
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3. Prove: If a \ b and b | a, then b = a or b = -a. 

Since a | 6 implies a #0 and 6 | a implies b ¥* 0, write b — ae and a — bd, where e,d G /. Now 
a • 6 = ( bd)(ac ) = abed and, by the Cancellation Law, 1 = cd. Then by Problem 16, Chapter 4, 
c = 1 or —1 and 6 = ac = a or —a. 


4. Prove: The number of positive primes is infinite. 

Suppose the contrary, i.e., suppose there are exactly n positive primes Pi, P2< P3, • • • > Pn> written 
in order of magnitude. Now form a = Pj • Vi * P3 ’ • • • ' Pn ar) d consider the integer a + 1. Since 
no one of the p’s is a divisor of a +1, it follows that a + 1 is either a prime > p„ or has a prime 
> p n as factor, contrary to the assumption that p„ is the largest. Thus, there is no largest positive 
prime and their number is infinite. 


5. Prove the Division Algorithm: For any two non-zero integers a and b, there exist 
unique integers q and r such that 

a = bq + r, 0 ^ r < |6| 

Define S = {a - bx : x G /}. If b < 0, i.e., b * -1, then 6 • |a| ^ - |«l “ * and a ~ 6 * W “ °‘ 
If 6 > 0, i.e., b ^ 1 , then b • (- |a|) * - \a\ * a and a-b(- |a|) *= 0. Thus, S contains non-negative 
integers; denote by r the smallest of these (r 2= 0) and suppose r = a-bq. Now if r ^ \b\, then 
r — |b| 2= 0 and r-|6| = a-bq-\b\ = a-(q+l)b < r or a-(q-l)b < r, contrary to the 
choice of r as the smallest non-negative integer G S. Hence, a < |b|. 

Suppose we should find another pair q' and r' such that 

a = bq' + r', 0 - r' < |6| 

Now bq' + r' = bq + r or b(q' - q) = r - r' implies 6 1 (»— r') and, since \r-r'\ < |6|, then 
r -r' = 0; also q' - q = 0 since 6^0. Thus, r' = r, q' = q, and q and r are unique. 


6 . 


Find (389,167) and express it in the form 

From 

389 = 167 •2 + 65 
167 = 66-3 + 2 
55 = 2-27 + 1 
2 = 1-2 


389m + 167n. 

We find 
1 = 66-2-27 
= 66-82 - 167-27 
= 389-82 - 167-191 


Thus, (389,167) = 1 = 82 • 389 + (—191)(167). 


7. Prove: If c\ab and if (a, c) = 1, then c | b. 

From 1 = ma + ne, we have b = mab + neb. Since e is a divisor of mab + neb, it is a divisor 
of b and c | b as required. 


Prove: If (a, s) = (b, s) = 1, then {ab, s) = 1. 

Suppose the contrary, i.e., suppose (06, s) = d > 1 and let d = (ab,s) = mab + ns. Now 
d | ab and d \ s. Since (a, s) = 1 , it follows that d/ a; hence, by Problem 7 , d \ b. But this contradicts 
(6,8) = 1; thus, (a.6, s) = 1. 


9. Prove: If p is a prime and if p \ ab, where a,b G I, then p\a or p\b. 

If p I a we have the theorem. Suppose then that p/ a. By definition, the only divisors of p are 
±1 and ±p; then (p,o) = 1 = mp + na for some m,nG 7 by Theorem II. Now 6 - mpb + nab 
and, since p | (mpb + nab), p | 6 as required. 
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10. Prove: Every integer a > 1 has a unique factorization (except for order) 
a = pi • j>* • pa • ... • p» 
into a product of positive primes. 

If a is a prime, the representation in accordance with the theorem is immediate. Suppose then 
that a is composite and consider the set S - {*: x > 1, * | a}. The least element » of S has no 
positive factors except 1 and s; hence s is a prime, say p,, and 

“ = Pi * *i, hi > 1 

Now either 6, is a prime, say p 2 , and a — p, • p 2 or 6,, being composite, has a prime factor p 2 and 
« — Pi " P2 ' h 2 , b 2 > 1 

A repetition of the argument leads to a = p, • p 2 • p 3 or 

and so on “ = Pi ‘ Pi ‘ Pa' f> 3 , b 3 > 1 

Now the elements of the set B = {hi, h 2 , h 3 ,...} have the property h, > h 2 > h 3 > • • •; hence, 
B has a least element, say h„, which is a prime p„ and 


« = Pi • Ps * Pa ' • • ■ ' Pn 

as required. 

To prove uniqueness, suppose we have two representations 


“ - *»i * Pi * Pa * • • • * P» = 9i * «2 * 93 ' • • • • 9m 
Now q, is a divisor of Pi * P 2 • p 3 * ... *p n ; hence, by Theorem V', q, is a divisor of some one of the 
p’s, say p t . Then q, = p u since both are positive primes, and by M 4 of Chapter IV, 
p 2 • p 3 • . .. • p„ = q 2 • q 3 • . .. • q m 

After repeating the argument a sufficient number of times, we find that m = n and the factorization 
is unique. 


11. Find the least positive integers modulo 5 to which 19, 288, 19 • 288 and 19 3 • 288 2 are 
congruent. 

We find 

19 = 5*3 + 4; hence 19 = 4(mod 5). 

288 = 5 • 57 + 3; hence 288 = 3(mod 5). 

19*288 = 5(- • •) + 12; hence 19*288 s 2(mod 6). 

193.2882 = 5(- • •) + 4» • 3* = 5(- • •) + 576; hence 19 3 • 2882 s l(mod 5). 


12. Prove: Let (c, m) = d and write m = w,d. If ca = cb(mod m), then o = 6(mod m i) 
and conversely. 

Write c = c,d so that (c„w,) = 1. If m|c(a-h), that is, if | c,d(a-6), then 

OTi|e,(a-6) and, since (c„to,) = 1, mi \(a-b) and a = b(mod mi). 

For the converse, suppose a = 6(mod to,). Since to, | (a - b), it follows that wii | c^a - b) and 
Wid | c,d(o — 6). Thus, to | c(a — b) and ca = c6(mod to). 


13. Show that, when a,b,p > 0 G /, (a + 6) p = a p + 6 p (mod p). 

By the binomial theorem, (a + b)» - a? + p(- • •) + and the theorem is immediate. 


14. Find the least positive incongruent solutions of: 

(a) 13* = 9(mod 25) (c) 259* s 5(mod 11) (e) 222* = 12(mod 18) 

(b) 207* s 6(mod 18) ( d) 7* = 5(mod 256) 

(a) Since (13, 25) = 1, the congruence has, by Theorem XI, a single incongruent solution. 
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Solution I. If x x is the solution, then it is clear that x, is an integer whose unit’s digit is 
either 3 or 8; thus x t G {3,8,13,18,23}. Testing each of these in turn, we find x x = 18. 

Solution II. By the greatest common divisor process we find (13, 25) = 1 = —1 • 25 + 2 • 13. 
Then 9 - - 9 • 25 + 18 • 13 and 18 is the required solution. 

(6) Since 207 = 18 • 11 + 9, 207 = 9(mod 18), 207* = 9*(mod 18) and, by transitivity, the given 

congruence is equivalent to 9* = 6(mod 18). By Theorem IX this congruence may be reduced to 
3* = 2(mod 6). Now (3,6) = 3 and 3/2; hence, there is no solution. 

(c) Since 259 = 11 • 23 + 6, 259 = 6(mod 11) and the given congruence is equivalent to 6x = 5(mod 11). 

This congruence has a single incongruent solution which by inspection is found to be 10. 

(d) Using the greatest common divisor process, we find (256, 7) = 1 = 2 • 256 + 7(—73); thus, 

5 = 10 • 256 + 7(—365). Now -365 = 147(mod 256) and the required solution is 147. 

(e) Since 222 = 18 • 12 4- 6, the given congruence is equivalent to 6* = 12(mod 18). Since 

(6, 18) = 6 and 6 112, there are exactly 6 incongruent solutions. As shown in the proof of 
Theorem XI, these 6 solutions are the first 6 positive integers in the set of all solutions of 
x = 2(mod 3), that is, the first 6 positive integers in [2] € //(mod 3). They are then 
2,5,8,11,14,17. 

15. Write 141 and 152 with base 4. Form their sum and product, and check each result. 
141 = 43.2 + 4 2 • 0 + 4 • 3 + 1; the representation is 2031. 

152 = 4 3 • 2 + 4 2 • 1 + 4 • 2 + 0; the representation is 2120. 

Sum. 

1 + 0 = 1; 3 + 2 = 11, we write 1 and carry 1; 1 + 1 + 0 = 2; 2 + 2 = 10 
Thus, the sum is 10211, base 4, and 293, base 10. 

Product. 

Multiply by 0: 

Multiply by 2: 2 • 1 = 2; 2 • 3 = 12, write 2 and carry 1; etc 
Multiply by 1: 

Multiply by 2: 

The product is 11032320, base 4, and 21432, base 10. 


0000 

10122 

2031 

10122 

11032320 


Supplementary Problems 

16. Show that the relation (|) is reflexive and transitive but not symmetric. 

17. Prove: If a \ b, then -a | 6, a | -b, and -a \ -b. 

18. List all the positive primes (a) < 50, (5) < 200. 

Ans. (a) 2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47 

19. Prove: If a = b • q + r, where a,b,q,r G /, then any common divisor of a and b also divides r 
while any common divisor of 6 and r also divides a. 

20. Find the greatest common divisor of each pair of integers and express it in the form of Theorem II: 

(o) 237, 81 A ms. 3 = 13 • 237 + (-38) • 81 

(6) 616, 427 Ans. 7 = — 9 • 616 + 13 • 427 

(c) 936, 666 Ams. 18 = 5 • 936 + (—7) • 666 

(d) 1137,419 Ams. 1 = 206-1137 + (-559) -419 


21. Prove: If s # 0, then (sa, sb) — |s| • (a, b). 
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22 . 

23. 

24. 

25. 


27. 


29. 


31. 


34. 

35. 

36. 

37. 

38. 

39. 


Prove: (a) If a | 8 , 6 | s and (a, 6) = 1, then ab | s. 

( b) If m = dm, and if m \ am lt then d \ a. 

Prove: If p, a prime, is a divisor of a • 6 • c, then p | a or p\b or p | c. 

The integer e = [a, 6] is called the least common multiple of the positive integers a and b when 
(1) a | e and 61 e, (2) if o | * and b\x then e\x. 

Find: (a) [3,7], (6) [3,12], (c) [22,715]. Ans. (a) 21, (6) 12, (c) 1430 

(a) Write the integers a — 19,500 and b = 54,450 as products of positive primes. 

(b) Find d = (a, b) and e = [a, 6]. 

(c) Verify d • e = a • b 

(d) Prove the relation in (c) when a and b are any positive integers. 

Ans. (b) 2 • 3 • 5 2 ; 2 2 • 3 2 • 5 3 • ll 2 • 13 


Prove: If m > 1, m /o, m jfb, then m | (o — 6) implies a — wig, = r = 6 — mq 2 , 0 < r < m, and 
conversely. 


Find all solutions of: 


(a) 4x = 3(mod 7) 

(5) 9* = 11 (mod 26) 

(c) 3x + 1 = 4(mod 5) 

(d) 8* = 6 (mod 14) 


(e) 153x s 6(mod 12) 

(f) x + 1 = 3(mod 7) 

(o) Sx s 6(mod 422) 

(h) 363* = 345(mod 624) 


Ans. (a) [6], (6) [7], (c) [1], (d) [6], [13], (e) [2], [6], [10], (/) [2], (g) [159], [370], (h) [123], [331], [539] 


Prove: Theorems V, VI, VII, VIII. 


Prove: If a = 6(mod m) and c = 6(mod m), then 
Page 52. 


= c(mod m). See Examples 10(a), (6), (c), 


(а) Prove: If a + x = b + x(mod m), then a = 6(mod to). 

(б) Give a single numerical example to disprove: If ax = bx( mod to), then ax = 6(mod to). 

(c) Modify the false statement in (6) to obtain a true one. 

(а) Interpret a = 6(mod 0). 

(б) Show that every x&l is a solution of ax = 6(mod 1). 

(a) Construct addition and multiplication tables for 1/(5). 

(b) Use the multiplication table to obtain 3 2 = 4(mod 5), 3 4 = l(mod 5), 3 8 = l(mod 5). 

(c) Obtain 3 2 58 = l(mod 5), 3™ = 4(mod 5), 3'™ = l(mod 5). 

Construct addition and multiplication tables for 1/(2), 1/(6), 1/(7), 1/(9). 

Prove: If [s] e//( to) and if a, 5 e [«], then a = 6(mod to). 

Prove: If [s], [t] G //(m) and if a G [s] and b G [t], then a = 6(mod to) if and only if [s] = [t]. 

Express 212 using in turn the base (a) 2, (b) 3, (c) 4, (d) 7, and (e) 9. 

Ans. (a) 11010100, (b) 21212, (c) 3110, (d) 422, (e) 265 

Express 89 and 111 with various bases, form the sum and product, and check. 

Prove the first part of the Unique Factorization Theorem using the induction principle stated in 
Problem 27, Chapter 3, Page 37. 
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The Rational Numbers 

THE RATIONAL NUMBERS 

The system of integers has an obvious defect in that, given integers m ¥= 0 and s, the 
equation mx — s may or may not have a solution. For example, Bx — 6 has the solution 
x = 2 but 4x = 6 has no solution. This defect is remedied by adjoining to the integers 
additional numbers (common fractions) to form the system Q of rational numbers. The 
construction here is, in the main, that used in Chapter 4. 

We begin with the set of ordered pairs 

K = / x (/ — {0}) = {(s,m): sGl, mGl-{ 0} } 

and define the binary relation ~ on all (s, m), (t, n) G K by 

(s,m) ~ ( t,n) if and only if sn = mt 

(Note carefully that 0 may appear as first component but never as second component in 
any ( s,m).) 

Now ~ is an equivalence relation (prove it) and thus partitions K into a set of 
equivalence classes . ,, , . 

where [«, m) = {(a, b) : (a, 6) G K, (a, b) ~ (s, m) } 

We shall call the equivalence classes of ^ rational numbers and in the following sections 
indicate that $ is isomorphic to the system Q as we know it. 

ADDITION AND MULTIPLICATION 

Addition and Multiplication on § will be defined respectively by 

(i) [s,m] + [t,n ] = [(sn + mt),mn] 

(ii) [s, m] • [t, n] = [st, mn] 

These operations, being defined in terms of well-defined operations on integers, are 
(see Problem 1) themselves well-defined. 

We now define two special rational numbers 

zero: [0 ,m] 0 one: [to, to] <-» 1 

and the inverses 

(additive): — [*, to] = for each [s, to] G ^ 

(multiplicative): [s,to]"‘ = [to,s] for each [s, to] G £ when s /- 0. 

By paralleling the procedures in Chapter 4, it is easily shown that addition and 
multiplication obey the laws Ai-Ae, M 1 -M 5 , D 1 -D 2 as stated for integers. 

A property of $, but not of I, is: 

M 8 : For every x ¥■ 0 G § there exists a multiplicative inverse x~ l G $ such 
that X' x~ l = x~ l ‘x = 1. 
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By Theorem IV, Chapter 2, the inverses defined in M e are unique. 

In Problem 2, we prove 

Theorem I. If x and y are non-zero elements of 3 then (x-y)~' = t/ -1 ** -1 . 
SUBTRACTION AND DIVISION 


Subtraction and division are defined on 3 by 


(iii) 

and 

x-y = x 4- (-y) 

for all x,y G 3 

(iv) 

respectively. 

x + y = x-y - 1 

for all x G 3 , y ¥= 0 G 3 


These operations are neither associative nor commutative (prove this). However, as 
on I, multiplication is distributive with respect to subtraction. 


REPLACEMENT 

The mapping [t, 1] G 3 +* t G I 

is an isomorphism of a certain subset of 3 onto the set of integers. We may then, when¬ 
ever more convenient, replace the subset 3 * = {[i,l] : [i,l] e $} by /. To complete the 
identification of 3 with Q, we have merely to replace 

and, in particular, [s, to] by s/m. X V by X ^ V 


ORDER RELATIONS 

An element x G Q, i.e., x <-+ [s, to] g 3 , is called positive if and only if s • to > 0. 
The subset of all positive elements of Q will be denoted by Q ' and the corresponding 
subset of $ by 3 + . Similarly, [ s,m ] is called negative if and only if s • to < 0. The 
subset of all negative elements of Q will be denoted by Q- and the corresponding subset 
°f $ by 3 • Since, by the Trichotomy Law of Chapter 4, either s • to > 0, s ■ to < 0 or 
s • to = 0, it follows that each element of 3 is either positive, negative or zero. 

The order relations < and > on Q are defined as follows: 

For each x,y G Q, 

x <y if and only if x — y < 0 
x>y if and only if x - y > 0 

These relations are transitive but neither reflexive nor symmetric. 

Q also satisfies 

The Trichotomy Law: If x,y G Q, one and only one of 

holds. (a) X = y (b) x < y (c ) x>y 


REDUCTION TO LOWEST TERMS 

Consider any arbitrary [s,m}&3 with s ^ 0. Let the (positive) greatest common 
divisor of s and w be d and write s = dsi, m = dm u Since (s, to) ~ (s h TOi), it follows that 
[s, to] = [si, TOi], i.e., s/to = si/toi. Thus, any rational number ^ 0 can be written 
uniquely in the form a/b where a and b are relatively prime integers. Whenever s/to has 
been replaced by a/b, we shall say that s/to has been reduced to lowest terms. Hereafter, 
any arbitrary rational number introduced in any discussion is to be assumed reduced to 
lowest terms. 
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In Problem 3 we prove: 

Theorem II. If x and y are positive rationals with x < y, then 1/x > 1/y. 

In Problems 4 and 5, we prove: 

The Density Property. If x and y, with x < y, are two rational numbers, there exists a 
rational number z such that x <z <y. 

and 

The Archimedean Property. If x and y are positive rational numbers, there exists a posi¬ 
tive integer p such that px > y\ 


DECIMAL REPRESENTATION 

Consider the positive rational number a/6 in which b > 1. Now 
a = qob + ro 0 — To < b 

&n< * 10ro = qib + ri 0 — n < b 

Since ro <6 and, hence, qib + ri = 10ro < 106, it follows that qi < 10. If ri = 0, then 
r ° = Tfib> a = q 0 b+^b, and = q 0 + qi/10. We write a/b = qo-Qi and call q 0 .qi the 
decimal representation of a/b. If n ¥> 0, we have 

10 n = g 2 & + r 2 0 — r 2 < 6 

in which <10. If r 2 = 0, then r, = |§6 so that n = ^b+^b and the decimal 
representation of a/b is qo.qw, if r 2 = r,, the decimal representation of a/b is the repeat¬ 
ing decimal g 0 . 9 i 92 <M 2 .; if r 2 ^0,ri, we repeat the process. 

Now the distinct remainders r 0 , r u r 2 ,... are elements of the set {0, 1, 2, 3.6-1} 

of residues modulo 6 so that, in the extreme case, n must be identical with some one of 
r 0 ,n,r 2 , .,r b -i, say r c , and the decimal representation of a/b is the repeating decimal 

qo.q\q 2 qa -<7b-i 9c+i 9c+2 • • • 9b-i9c+i9c+ 2 - qo-i - 

Thus, every rational number can be expressed as either a terminating or a repeating decimal. 
Example 1: (a) 5/4 = 1.25 

(, b) 3/8 = 0.375 

(c) For 11/6, we find n = x . 6 + 5; g 0 = 1, r 0 = 6 

10 • 5 = 8 • 6 + 2; qi = 8, r t = 2 

10 • 2 = 3*6 + 2; q 2 = 3, r 2 = 2 = 

and 11/6 = 1.833333.... 


( d) For 25/7, 


find 

25 

10*4 

10*5 

10*1 

10*3 

10*2 

10*6 


3*7 + 4; 
5*7 + 5; 
7*7 + 1; 
1*7 + 3; 
4*7 + 2; 
2*7 + 6; 
8*7 + 4; 


90 _ r o — 4 

91 = 5, »T = 5 
« 2 = 7, r 2 = 1 
qr 3 = 1, r 3 = 3 
94 = 4, r 4 = 2 
9s = 2, r 5 = 6 


and 25/7 = 3.571428 571428.... 
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Conversely, it is clear that every terminating decimal is a rational number. For 
example, 0.17 = 17/100 and 0.175 = 175/1000 = 7/40. 

In Problem 6, we prove 

Theorem III. Every repeating decimal is a rational number. 

The proof makes use of two preliminary theorems: 

(i) Every repeating decimal may be written as the sum of an infinite geometric pro¬ 
gression. 

(ii) The sum of an infinite geometric progression whose common ratio r satisfies |r| < 1 
is a finite number. 

A discussion of these theorems can be found in any College Algebra book. 


Solved Problems 


1. Show that addition and multiplication on § are well-defined. 

Let [a,6] = [s,m] and [c,d] = [t,»]. Then (a, 6) ~ (*, m) and (c,d) ~ (t,n), 
and cn = dt. Now 


[«>*] + [«,<*] 


and addition is well-defined. 


[(od + be), 6d] = [(ad + bc)mn, bd • mn] 

[(am • dn + cn • 6m), bd • mn] 

[(6s • dn + dt' bm), bd • mn] 

[6d(sw + tm), bd • mn] 

[sn + tm.rnn] = [s,m] + [t,n] 


that am = bs 


Also [a, 6] • [c, d] 


and multiplication is well-defined. 


[ae, 6d] = [ac • mn, 6d • mw] 

[am • cn, bd • mn] = [6s • dt, bd • mn] 

[6d • st, bd • mn] = [st, mn] 

[s,m] • [t,n] 


2. Prove: If x,y are non-zero rational numbers then (x-y)~ i = y~ 1 -x~ 1 . 

Let x [s,m] and y [ t,n ], so that x~ l ** [m,s] and y~ l *-> [n, t]. Then * • y *+ [s,m] • [t,n] = 

[st,mn] and (x-y)- 1 <h> [mn,st] = [n, t] • [m, s] y- 

3. Prove: If x and y are positive rationals with x < y, then \/x > 1/y. 

Let x *-* [s,m] and y <-> [ t,n ]; then sm >0, tn> 0, and sn < mt. Now, for 1/x = [m, s] 

and l/y [n, t\, the inequality mt > sn implies 1/x > 1/y, as required. 

4. Prove: If x and y, with x < y, are two rational numbers, there exists a rational 
number z such that x <z <y. 

Since x < y, we have 

2x = x + x < x + y and x + y < y + y — 2y 
Then 2x < x + y < 2y 

and, multiplying by x < £(x + y) < y. Thus, £(x + y) meets the requirement for z. 
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5. Prove: If x and y are positive rational numbers, there exists a positive integer p 

such that px > y. ^ , 

Let x [s,m\ and y *+ [t,n\, where s, m, t, n are positive integers. Now px > y if and only 

if psn > mt. Since sri-1 and 2sn > 1, the inequality is certainly satisfied if we take p - 2mt. 


Prove: Every repeating decimal represents a rational number. 

Consider the repeating decimal 

x.yzdef def. ... = x.yz + 0.00 def + 0.00000 def + 

Now x.yz is a rational fraction since it is a terminating decimal, while O.OOde/ + 0.00000 def + 
is an infinite geometric progression with first term a = 0.00 def, common ratio r = 0.001, and sum 

c — _“_ = 0.00 de f_ _ <fe/ a rational fraction 

& - 1 - r 0.999 99900 

Thus the repeating decimal, being the sum of two rational numbers, is a rational number. 


7. Express (a) H with base 4 > ( b ) * with base 5 * 

(a) 27/32 = 3(£) + 3/32 = 3(£) + l(i> 2 + 1/32 = 3<i> + 1(±) 2 + 2(1) 3 

The required representation is 0.312. 

(b) 1/3 = 1(*) + & = m + 3(i) 2 + 1/75 

= !(i> + 3 (£) 2 + Ki > 3 + 2/376 

= l(£) + 3<£) 2 + Ki) 3 + 3 <i> 4 + 1/1875 
The required representation is 0.131313. 


Supplementary Problems 

8. Verify: (a) [s,m] + [0,«] = [s,m] (e) [s,m] + [-s,w] = [0,w] = [s,m] + [s.-m] 

(b) [s, m] • [0, n] = [0,«] (d) [s,w] • [w.s] = [n,«] 

9. Restate the laws A,-A 6 , M r M 5 , D r D 2 of Chapter 4 for rational numbers and prove them. 

10. Prove: (a) ( fj + is closed with respect to addition and multiplication. 

(6) If [s,«t] £ ^+, so also does [s.m]- 1 . 

11. Prove: (a) is closed with respect to addition but not with respect to multiplication. 

(b) If [s,m] e so also does [s.m]- 1 . 

12. Prove: If x,y £ Q and x • y = 0, then x = 0 or y = 0. 

13. Prove: If x,y e Q, then (a) -(* + v) = -x-y and (6) -(-*) = x. 

14. Prove: The Trichotomy Law. 

15. If x.y.z e Q, prove: 

(а) x + z < y + z if and only if x < y ; 

(б) when z > 0, xz < yz if and only if x < y; 

(c) when z < 0, xz < yz if and only if x > y. 

16. If w,x,y,z S Q with in (o) and (b), and xyz #0 in (c), prove: 

(a) (w + x) ± (y + z) = (wz ± xy) + xz 

(b) (w + x)-(y + z) = wy + xz 

(c) (w + x) +(y + z) = wz + xy 

17. Prove: If a ,b£Q + and a < b, then a* < ab < 6 2 . What is the corresponding inequality if 
a,b€ Q-? 
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INTRODUCTION 

Chapters 4 and 6 began with the observation that the system X of numbers previously 
studied had an obvious defect. This defect was remedied by enlarging the system X. In 
doing so, we defined on the set of ordered pairs of elements of X an equivalence relation, 
etc., etc. In this way we developed from N the systems I and Q satisfying N c I C Q. 
For what is to follow, it is important to realize that each of the new systems I and Q has 
a single simple characteristic, namely, 

/ is the smallest set in which, for arbitrary m,sEN, the equation m + x = s 
always has a solution. 

Q is the smallest set in which, for arbitrary integers m¥= 0 and s, the equation 
mx — s always has a solution. 

Now the situation here is not that the system Q has a single defect; rather, it is there 
are many defects and these so diverse that the procedure of the previous chapters will not 
remedy all of them. We mention only two: 

(1) The equation x 2 = 3 has no solution in Q. For, suppose the contrary, and assume 

that the rational a/6, reduced to lowest terms, be such that (a/6) 2 = 3. Since 

a 2 = 36 2 , it follows that 31 a 2 and, by Theorem V, Chapter 5, that 3 | a. Write a = 3a,; 

then 3af = 6 2 so that 3 j 6 2 and, hence, 316. But this contradicts the assumption 
that a/b was expressed in lowest terms. 

(2) The circumference c of a circle of diameter d E Q is not an element of Q, i.e., in 

c = nd, it £ Q. Moreover, ir 2 & Q so that -t? is not a solution of x 2 = q for any q E Q. 

(In fact, it satisfies no equation of the form ax" + 6x n ~‘ + • • ■ + sx + f = 0 with 

a, 6, . . ., s, t E Q.) 

The method, introduced in the next section, of extending the rational numbers to the 
real numbers is due to the German mathematician R. Dedekind. In an effort to motivate 
the definition of the basic concept of a Dedekind cut or, more simply, a cut of the rational 
numbers, we shall first discuss it in non-algebraic terms. 

Consider the rational scale of Fig. 7-1, that is, 
a line L >n which the non-zero elements of Q are 
attached to points at proper (scaled) distances from 
the origin which bears the label 0. For convenience, 
call each point of L to which a rational number is 
attached a rational point. (Not every point of L 
is a rational point. For, let P be an intersection of 
the circle with center at 0 and radius 2 units with 
the line parallel to and 1 unit above L. Then let the 
perpendicular to L through P meet L in T; by (1) 
above, T is not a rational point.) Suppose the line L 
to be cut into two pieces at some one of its points. 

There are two possibilities: 



Fig. 7-1 
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(a) The point at which L is cut is not a rational point. Then every rational point of L 
is on one but not both of the pieces. 

(b) The point at which L is cut is a rational point. Then, with the exception of this 
point, every other rational point is on one but not both of the pieces. Let us agree 
to place the exceptional point always on the right-hand piece. 

In either case, then, the effect of cutting L at one of its points is to determine two 
non-empty proper subsets of Q. Since these subsets are disjoint while their union is Q, 
either defines the other and we may limit our attention to the left-hand subset. This 
left-hand subset is called a cut and we now proceed to define it algebraically, that is, 
without reference to any line. 


DEDEKIND CUTS 

By a cut C in Q we shall mean a non-empty proper subset of Q having the additional 
properties: 

(i) if c G C and a G Q with a < c, then aGC. 

(ii) for every c G C there exists b GC such that b > c. 

The gist of these properties is that a cut has neither a least (first) element nor a 
greatest (last) element. There is, however, a sharp difference in the reasons for this 
state of affairs. A cut C has no least element because, if cGC, every rational number 
a < c i s an element of C. On the other hand, while there always exist elements of C 
which are greater than any selected element c G C, there also exist rational numbers 
greater than c which do not belong to C, i.e., are greater than every element of C. 

Example 1: Let r be an arbitrary rational number. Show that C(r) = (a . a G Q, a < r} is a cut. 

Since Q has neither a first nor last element, it follows that there exists r t eQ 
such that r, < r (thus, C(r) ^ 0) and r 2 €Q such that r 2 > r (thus, C(r) ¥>Q). 
Hence, C(r) is a non-empty proper subset of Q. Let cGC(r), that is, e < r. Now 
for any a G Q such that a < e, we have a < c < r; thus, a G C(r) as required in (i). 
By the Density Property of Q there exists deQ such that c < d < r, then d > c 
and d G C(r), as required in (ii). Thus, C(r) is a cut. 

The cut defined in Example 1 will be called a rational cut or, more precisely, the cut 
at the rational number r. For an example of a non-rational cut, see Problem 1. 

When C is a cut, we shall denote by C' the complement of C in Q. For example, if 
C = C(r) of Example 1, then C' = C'(r) = {a': a' G Q, a' ^r}. Thus, the complement 
of a rational cut is a proper subset of Q having a least but no greatest element. Clearly, 
the complement of the non-rational cut of Problem 1 has no greatest element; in Problem 2, 
we show that it has no least element. 

In Problem 3, we prove 
Theorem I. If C is a cut and r G Q, then 

(a) D — {r + a: aGC) is a cut and (b) D' = {r + a': a' GC'} 

It is now easy to prove 
Theorem II. If C is a cut and rGQ + , then 

(a) E - [ra: aGC} is a cut and (6) E’ - [ra’: a' G C'} 

In Problem 4, we prove 

Theorem III. If C is a cut and r G Q + , there exists bGC such that r + bGC\ 


CHAP. 7] 


THE REAL NUMBERS 


67 


POSITIVE CUTS 

Denote by X the set of all cuts of the rational numbers and by X + the set of all cuts 
(called positive cuts) which contain one or more elements of Q + . Let the remaining cuts 
in X be partitioned into the cut 0, i.e., 0 = C(0) = {a: aGQ], and the set X~ of all 
cuts containing some but not all of the elements of Q~. For example, C(2) G X + while 
C(—5) G X~■ We shall, for the present, restrict our attention solely to the cuts of X + for 
which it is easy to prove 

Theorem IV. If CGX + and r > 1 G Q + , there exists cGC such that rcGC'. 

Each C G X + consists of all elements of Q , 0, and see (Problem 5) an infinitude of 
elements of Q + . For each C G X + , define £ — {a: a G C, a> 0} and denote the set of 
all C s by Ji. For example, if C ~ C( 3) then £(3) = {a: aGQ, 0<a<3} and C may be 
written as C = C(3) = Q~ U {0} U £(3). Note that each C G X + defines a unique £ G Ji 
and, conversely, each £ G Ji defines a unique C GX + - Let us agree on the convention: 
when Ci G X + , then Ci = Q~ U {0} U £ r 

We define addition (+) and multiplication (•) on X + as follows: 

C1 + C2 = Q-u {0} U (£j + C 2 ) 

for each C t ,C 2 GX + 

Ci-Ct = Q~ U {0} U (£,• C 2 ) 

. \ Ct+C 2 = ( c , + c 2 : c G £j, c 2 G £ } 

where (i) J 

lc t -c 2 = ( c i * c 2 : C 1 G £,, c 2 G C 2 ) 

It is easy to see that both Ci + C 2 and Ci • C 2 are elements of X + • Moreover, since 
Ci + C 2 = {a: aGCi + C 2 , a > 0} 

and 

C t ‘C 2 = {a: aGCi-Ci, a > 0} 

it follows that Ji is closed under both addition and multiplication as defined in (i). 


Example 2: Verify: (a) C(3) + C(7) = C(10), (6) C(3) C(7) = C(21). 

Denote by C(3) and C(7) respectively the subsets of all positive rational numbers 
of C(3) and C(7). We need then only verify 

C(3) + C(7) = C(10) and C(3)*C(7) = C(21) 

(a) Let e t e C( 3) and c 2 G C(7). Since 0 < c 1 < 3 and 0 < c 2 < 7, we have 
0 < c, + c 2 < 10. Then c, + c 2 G C(10) and C(3) + C(7) C C(10). Next, suppose 
c 3 G C(10). Then, since 0 < e 3 < 10, 

0 < < 3 and 0 < < 7 

that is, ^jC 3 G C( 3) and ^c 3 G f( 7). Now c 3 = ~c 3 + ^c 3 ; hence, 
C(10) C C(3) + cm Thus, C(3) + C(7) = C(10) as required. 

(b) For cj and c 2 as in (a), we have 0 < Cj • c 2 < 21. Then c l • c 2 G C(21) and 
C(3) • C(7) C C(21). Now suppose c 3 G C(21) so that 0 < c 3 < 21 and 0 <C = 
9 < 1. Write q = q, • q 2 with 0 < q x < 1 and 0 < q 2 < 1. Then c 3 = 21 q = 
(39i)( 79 2 ) with 0 < 3g, < 3 and 0 < 7q 2 < 7, i.e., 3?! G C(3) and 7q 2 G C(7). 
Then C(21)CC(3)-C(7) and C(3) • C(7) = C(21) as required. 

The laws A1-A4, M1-M4, Di-D 2 in X + follow immediately from the definitions of addition 
and multiplication in X + and the fact that these laws hold in Q + and, hence, in Ji. 
Moreover, it is easily shown that C(l) is the multiplicative identity, so that Ms also holds. 
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MULTIPLICATIVE INVERSES 

Consider now an arbitrary C = Q U {0} Uf£ and define 
C~ l = {b: bGQ+, b < a 1 for some a G C'} 

In Problem 6 we prove: 

If C = Q~ U {0} U C then C' 1 — U {0} U £ _1 is a positive cut. 

In Problem 7 we prove: 

For each C G 7^ + , its multiplicative inverse is C ' G% +. 

At this point we may move in either of two ways to expand % + into a system in which 
each element has an additive inverse: 

(1) Repeat Chapter 4 using % + instead of N. 

(2) Identify each element of as the additive inverse of a unique element of 
We shall proceed in this fashion. 

ADDITIVE INVERSES 

The definition of the sum of two positive cuts is equivalent to 

C 1 + C 2 = {C1 + C2: c, GCi, C 2 GC 2 }, c u C2e% + 

We now extend the definition to embrace all cuts by 

C1 + C2 = {C1 + C2: Ci G Ci, C2GC2}, Ct.Ciex ( 1 ) 

Example 3: Verify C(3) + C(-7) = C(-4). 

Let c, + c 2 e C(3) + C(—7), where e, e C(3) and c 2 £ C(-7). Since Ci < 3 
and c 2 < —7, it follows that c 1 + e 2 < —4 so that c t + c 2 £ C(— 4). Thus, 
C(3) + C(—7) c C(—4). 

Conversely, let c 3 £ C(— 4). Then e 3 <—4 and —4 —c 3 =d£Q + . Now 
e 3 = -4 - d = (3 - $d) + (-7 - £d); then since 3 - € C(3) and -7 - $d £ C(-7), 

it follows that c 3 £ C(3) + C(- 7) and C(-4) C C(3) + C(-7). Thus, C(3) + C(-7) = 
C(— 4) as required. 

Now, for each C G%, define 

—C = {x : x G Q, x < — a for some a G C'} 

For C = C(— 3), we have — C = {x: x GQ, x < 3} since -3 is the least element of C'. 
But this is precisely C(3); hence, in general, 

-C(r) = C(-r) when r GQ 

In Problem 8 we show that — C is truly a cut, and in Problem 9 we show that —C is 
the additive inverse of C. Now the laws Ai-A 4 hold in %. 

In Problem 10 we prove 

The Trichotomy Law. For any C G %, one and only one of 

C = C( 0) C G % + -CG% + 

holds. 

MULTIPLICATION ON % 

For all C G%, we define 

C > (7(0) if and only if C G X + 

C < (7(0) if and only if —C G% + 
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and |C| = C when C — C(0) 

|C| = —C when C < (7(0) 

Thus, |q^C(0), that is, \C\ = (7(0) or |C|e*; + . 

For all Ci, C 2 G %, we define 

Ct • C 2 = C( 0) when Ci = C(0) or C 2 = (7(0) (2) 

■ Ci‘C 2 = |Ci|• \C 2 \ when C t >C( 0) and C 2 > C(0) or when Ci < (7(0) and C 2 < C( 0) 

Ci • C 2 = — (|Ci| • |C 2 |) when Ci > C(0) and C 2 < C(0) or when Ci < (7(0) and C 2 > (7(0) 
Finally, for all C ¥= (7(0), we define 

C-' = \C\~' when C > C(0) and C~' = -(l^l" 1 ) when C < C(0) 

It now follows easily that the laws Ai-A 6 , Mi-M e , D1-D2 (see Page 71) hold in 


SUBTRACTION AND DIVISION 

Paralleling the corresponding section in Chapter 6, we define for all Ci, C 2 G% 
Ci - C 2 = Ci + ( -C 2 ) 

and, when C 2 ¥• C( 0), 

Ci -5- C 2 = Ci-C, 1 


(3) 

(*) 


Note. We now find ourselves in the uncomfortable position of having two completely 
different meanings for Ci - C 2 . Throughout this chapter, then, we shall agree to consider 
Ci — C 2 and Ci D C' 2 as having different meanings. 


ORDER RELATIONS 

For any two distinct cuts Ci, C 2 G %, we define 

Ci < C 2 , also C 2 > Ci, to mean Ci-C 2 < (7(0) 

In Problem 11, we show 

Ci < C 2 , also C 2 > Ci, if and only if Ci C C 2 
There follows easily 

The Trichotomy Law, For any Ci,C 2 G %, one and only one of the following holds: 

(a) Ci = C 2 (b) Ci < C 2 (c) Ci > Ci 

PROPERTIES OF THE REAL NUMBERS 

Define %* = {C(r): C(r) G%, rG Q}. We leave for the reader to prove 
Theorem V. The mapping C{r) G %* -+ r G Q is an isomorphism of %* onto Q. 

The elements of % are called real numbers and, whenever more convenient, % will be 
replaced by the familiar R while A,B, ... will denote arbitrary elements of R. Now 
Q C R; the elements of the complement of Q in R are called irrational numbers. 

In Problems 12 and 13, we prove 

The Density Property. If A,B G R with A < B, there exists a rational number C(r) 
, such that A < C(r) < B. 

and 

The Archimedean Property. If A,B G R + , there exists a positive integer C(n) such 
that C(n) - A> B. 
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In order to state below an important property of the real numbers which does not hold 
for the rational numbers, we make the following definition: 

Let S ¥* 0 be a set on which an order relation < is well-defined, and let T be any 
proper subset of S. An element s G S, if one exists, such that s — t for every t G T 
(s~t for every t G T) is called an upper bound ( lower bound) of T. 

Example 4: (a) If S-Q and T = {-5,-1,0,1,3/2}, then S/2,2,102,... are upper bounds 

of T -while -5, -17/3, -100, ... are lower bounds of T. 

( b ) When S = Q and T = C G %, then T has no lower bound while any t' G V = C' 

is an upper bound. On the other hand, V has no upper bound while any 

tGT is a lower bound. 

If the set of all upper bounds (lower bounds) of a subset T of a set S contains a least 
element (greatest element) e, then e is called the least upper bound (greatest lower bound) 
of T. 

Let the universal set be Q and consider the rational cut C(r) G X- Since r is the 

least element of C'(r), every s ^ r in Q is an upper bound of C(r) and every t ^ r in Q is 

a lower bound of C'(r). Thus, r is both the least upper bound (l.u.b.) of C(r) and the 
greatest lower bound (g.l.b.) of C'(r). 

Example 5: (a) The set T of Example 4(a) has 3/2 as l.u.b. and -5 as g.l.b. 

(6) Let the universal set be Q. The cut C of Problem 1 has no lower bounds and, 
hence, no g.l.b. Also, it has upper bounds but no l.u.b. since C has no greatest 
element and C' has no least element 

(c) Let the universal set be R. Any cut C € %, being a subset of Q , is a subset 
of R. The cut C(r) then has upper bounds in R and r € R as l.u.b. Also, the 
cut C of Problem 1 has upper bounds in R and y/Z e R as l.u.b. 

Example 5(c) illustrates 

Theorem VI. If S is a non-empty subset of X and if S has an upper bound in X, it has 
a l.u.b. in X • For a proo f ( see Problem 14. 

Similarly, we have 

Theorem VI'. If S is a non-empty subset of X and if S has a lower bound in X> it has a 
g.l.b. in X- 


Thus, the set R of real numbers has the 

Completeness Property. Every non-empty subset of R having a lower bound (upper 
bound) has a greatest lower bound (least upper bound). 

Suppose 0 n = a, where a,6 GR + and n G I + . We call 9 the principal nth root of « 
and write 6 = « 1/n . Then for r = min G Q, there follows « r = 9 m . 

Other properties of R are: 

(1) For each a G R + and each n G /+, there exists a unique 9 G R + such that 9 n = a. 

(2) For real numbers « > 1 and /8, define a s as the l.u.b. of {a r : r G Q, r < /?}. Then a e 
is defined for all a > 0, j8 £ R by setting a* = (l/«)“ s when 0 < a < 1. 
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SUMMARY 


As a partial summary to date, there follows a listing of the basic properties of the 
system R of all real numbers. At the right, in parentheses, is indicated other systems 
N,I,Q for which each property holds. 

Addition 


A t . Closure Law 

A 2 . Commutative Law 
A 3 . Associative Law 

A 4 . Cancellation Law 

A 5 . Additive Identity 

A 6 . Additive Inverses 


Multiplication 
M 4 . Closure Law 
M 2 . Commutative Law 
M 3 . Associative Law 
M 4 . Cancellation Law 


M 5 . Multiplicative Identity 
Mg. Multiplicative Inverses 


Distributive Laws 
D t . 


Density Property 
Archimedean Property 
Completeness Property 


r + sGB, for all r,s G R ( N,I,Q) 

r + s = s + r, for all r,s G R (JV, I, Q) 

r + (a + t) = (r + *) + t, for all r,s,t G R ( N,I,Q) 

If r+t = 8 + t, then r = s for all r,s, t G R. (N, I, Q) 

There exists a unique additive identity element 0 G R 
such that r + 0 = 0 + r = r, for every r G R. (/, Q) 

For each r&R, there exists a unique additive inverse 
—rGR such that r + (— r) = (—r) + r = 0. (/, Q) 

V8&R, for all r,« G R (#, / > Q) 

r-s = 8'r, for all r,« G R (N,/,Q) 

r* (••«) = (»••») »t, for all r,8,t G R ( N,I,Q) 

If m • p = n • p, then m = n, for all m,n G R and 
P^OGR. ( N,I,Q) 

There exists a unique multiplicative identity element 
1 e B such that 1 • r = r • 1 = r for every r G R. (N, I, Q) 

For each r # 0 G R, there exists a unique multipli¬ 
cative inverse r»efi such that r • r~ > = r~ 1 • r = 1. (Q) 

For every r,8,t G R, 
r • (s + t) = r •« + r • t 

(* + t) • r = s • r + t • r ( N,I,Q) 

For each r,8 G R, with r < 8, there exists t G Q 

such that r < t < 8. (Q) 

For each r,s G ft+, with r < a, there exists «G/+ 

such that w • r > s. (qj 

Every non-empty subset of R having a lower bound 
(upper bound) has a greatest lower bound (least 
upper bound). (# ( /) 


Solved Problems 


l. 


Show that the set S consisting of Q , zero, and all s G Q+ such that s 2 < 3 is a cut. 

First, S is a proper subset of Q since 1 G S and 2 G S. Next, let e G S and a G Q with a < c. 
Clearly, sGS when a — 0 and also when c — 0. 


For the remaining case (0 < o < e), a* < ae < c* < 3; then o 2 < 3 and nG5 as required in (i) 
Page 66. Property (ii). Page 66, is satisfied by b = 1 when c ^ 0; then S will be a cut provided 
that for each c > 0 with c 2 < 3 an wG Q+ can always be found such that (c + m) 2 < 3. We can 
simplify matters somewhat by noting that if p/q, where p, g G /+, should be such an m so also is 


1/q. Now q 
2c+1 ^ „ 






. 2c . 


1 


c 2 + 


2c + 1 


thus c + ij < 3 provided 
c 2 , that is, provided (3 - c 2 )g > 2c + 1. Since (3-c 2 ) G Q+ and (2c + 1) G Q + , the 
satisfying the latter inequality is assured by the Archimedean Property of Q + . 
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2. Show that the complement S' of the set S of Problem 1 has no smallest element. 

For any r G S' = {a: a G Q + , o 2 - 3}, we are to show that a positive rational number m 
can always be found such that (r - m) 2 > 3, that is, the choice r will never be the smallest element 
in S’. As in Problem 1, matters will be simplified by seeking an m of the form 1/q where qG IK 

Now ( r—- Y = r 2 - — + \ > r 2 - —; hence, > 3 whenever ^ < r 2 -3, that 

\ qj Q Q 2 9 \ 7 / 9 

is, provided (r 2 - 3)q > 2r. As in Problem 1, the Archimedean Property of Q + assures the existence 
of q G I + satisfying the latter inequality. Thus S’ has no smallest element. 


3. Prove: If C is a cut and r G Q, then (a) D = {r + a : a G C) is a cut and (b) D' = 
{r + a': a'GC'}. 

(а) D # 0 since C # 0; moreover, for any e'GC’, r + c'&D and D ¥= Q. Thus D is a proper 
subset of Q. 

Let b G C. For any sGQ such that s < r+b, we have s-r < b so that s-rGC and 
then s = r + ( 8 - r ) G D as required in (i), Page 66. Also, for b G C there exists an element 
cGC such that c > 6; then r + b, r + cGD and r + e > r + 6 as required in (ii). Page 66. 
Thus D is a cut. 

(б) Let 6' G C’. Then r + b’ G D since b' G C; hence, r + b’ G D’. On the other hand, if 
q' = r + p' G D’ then p’ € C since if it did we would have D n D’ + 0. Thus D' is as 
defined. 


4. Prove: If C is a cut and r G Q + , there exists b G C such that r + b GC. 

From Problem 3, D = {r+ a : a G C} is a cut. Since r > 0, it follows that C C D. Let q G Q 
such that p = r+qGD but not in C. Then q G C but r + qGC. Thus q satisfies the requirement 
of b in the theorem. 


5. Prove: If C G ^ + , then C contains an infinitude of elements of Q + . 

Since C G X+ there exists at least one r G Q + such that r GC. Then for all q G N we have 
r/q G C. Thus, no C G%+ contains only a finite number of positive rational numbers. 


6. Prove: If C = Q~ U {0} U C e % + , then C~ l = Q~ U {0} U C _1 is a positive cut. 

Since C ^ Q + , it follows that C’ K 0; and since C * 0> it follows that C’ ¥= Q + . Let d G C. 
Then (d+l)~ 1 GQ+ and (d + l)- 1 ^" 1 so that (d + lJ-^C -1 and C _1 ^ 0- Also, if e G C 
then for every a G C’ we have e < a and e~ l > a~ l ; hence, c 1 « C 1 and C 1 * Q + - Thus C ls 
a proper subset of Q. 

Let c G C~ l and r G Q+ such that r < c. Then r < c < d l for some d G V and r SC -1 as 
required in (i), Page 66. Also, since there exists sGQ+ such that c<s<d and 

s G C" 1 as required in (ii). Thus C' 1 is a positive cut. 


7. Prove: For each C G%*, its multiplicative inverse is C ' G%K 


Then C-C~ l = io-b : c G C, b G C” 1 !- 
, that 6c < 1. Thus, C * C _1 C CU)- 


Let C = Q- U {0} u C so that C l = Q~ u {0} u C 
Now 6 < d _1 for some d G C’, and so 6d < 1; also, c < 

Let n G C(l) so that n -1 > 1. By Theorem IV there exists c G C such that 
each aGC such that a > c, we have n-o" 1 < n-c" 1 = (c^n- 1 ) -1 ; thus, 1 

w = ae G C• C _1 and CU)CC*C _1 - Hence, C*C _1 = C(D and C*C 
C- 1 G %+. 


c’n-'GC'. For 
= eGC" 1 . Then 
= C(l). By Problem 6, 
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8. When Cej;, show that —C is a cut. 

Note first that —C ¥= 0 since C' ¥= 0. Now let c6C; then —e £ — C, for if it were we would have 
~c < —c' (for some c' G C') so that c' < e, a contradiction. Thus, —C is a proper subset of Q. 
Property (i), Page 66, is immediate. To prove property (ii), let x G —C, i.e., x < —c' for some 
c'GC'. Now x < |(*-c') < -c'. Thus, \{x - c') > x and \(x-e') G -C. 


9. Show that — C of Problem 8 is the additive inverse of C, i.e., C + (— C) = —C + C — C( 0). 

Let c + x G C + (—C), where cGC and x G — C. Now if * < —c' for e' G C', we have 
c + x < c-c' < 0 since e < e'. Then C + (-C) C C(0). Conversely, let j/,zGC(0) with z > y. 
Then, by Theorem III, there exist cGC and c'GC' such that c+(z-y) = c'. Since z-e’ < -c', 
it follows that z-e’ G -C. Thus, y = c + (z-c’) G C + (-C) and C(0)cC+(-C). Hence’, 
C + (-C) = -C + C = C( 0). 


10. Prove the Trichotomy Law: For C € %, one and only one of 


is true. 


c = c( o) cer -c e %+ 


Clearly, neither C(0) nor -C(0) G Suppose now that C * C(0) and C G Since every 

cGC is a negative rational number but C # Q~, there exists c'G Q~ such that c'GC'. Since 
c'<|c'< 0, it follows that 0 < —-Jc' < —o’. Then -£c' G -C and so —C G % + . On the contrary, 
if CG*[ + , every c'GC' is also 6Q+. Then every element of —C is negative and —C G %+. 


11. Prove: For any two cuts Ci,C 2 € we have Ci < C 2 if and only if Ci C C 2 . 

Suppose C! C C 2 . Choose a' G C 2 nC,' and then choose 6 G C 2 such that b > o'. Since -b < -o', 
it follows that —b G -C t . Now — C t is a cut; hence there exists an element c G —C t such that 
e > -b. Then 6 + c > 0 and 6 + cG C 2 +(-C,) = C 2 -C l so that C 2 — Cj G ■*;+. Thus C 2 -C, > C(0) 
and Cj < C 2 . 

For the converse, suppose C, < C 2 . Then C i -C 1 > C(0) and C 2 -C, 6 7C+. Choose d G Q + 
such that d G C 2 - C, and write d = 6 + o where 6 G C 2 and o G —C t . Then -6 < o since d > 0; 
also, since a G -C 1( we may choose an o' G Cj such that o < -o'. Now -6 < -o'; then o' < 6 
to that b G C t and, thus, C 2 $ C,. Next, consider any * G Cj. Then * < 6 so that * G C 2 and, hence, 


12. Prove: If A,BgR with A<B, there exists a rational number C(r) such that 
A < C(r) <B. 

Since A < B, there exist rational numbers r and s with s < r such that r,iGB but not in A. 
Then A — C(s ) < C(r) < B, as required. 


13. Prove: If A,BgR + , there exists a positive integer C(n) such that C(n) • A> B. 
Since this is trivial for A — B, suppose A < B. Let r,s be positive rational numbers such that 
r G A and s G B'; then C(r) < A and C(s) > B. By the Archimedean Property of Q, there exists a 
positive integer n such that nr > s, i.e., C(it) • C(r) > C(s). Then 

. C(w) • A ^ C(n) • C(r) > C(s) > B 


14. Prove: If S is a non-empty subset of % and if S has an upper 

1 11 h in V 


Douna (in AJ, 




l.u.b. in 

Let S = {Cl, c 2 , C 3 , ... > be the subset and C be an upper bound. The union i/ = C,UC 2 uC 3 U 

of the cuts of S is itself a cut G also, since C x Q V, C 2 Q U, C 3 Q U . U is an upper bound 

of <5- But C t C C, C 2 C C, C 3 C C, .hence V C C and V is the l.u.b. of S- 
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Supplementary Problems 

15. (a) Define C(3) and C(-7). Prove that each is a cut. 

(b) Define C'( 3) and C'(-7). 

(c) Locate —10, —5,0,1,4* as G or € each of C( 3), C(—7), C'(3), C'(—7). 

(d) Find 5 rational numbers in C(3) but not in C(—7). 

16. Prove: C(r) C C(s) if and only if r < s. 

17. Prove: If A and B are cuts, then A C B implies A # B. 

18. Prove: Theorem II, Page 66. 

19. Prove: If C is a cut and r G Q+, then C — D = {a + r: a G C}. 

20. Prove: Theorem IV, Page 67. 

21. Let r G Q but not in C G %. Prove: C — C(r). 

22. Prove: (a) C( 2) + C(5) = C(7) (c) C(r) + C(0) = C(r) 

(b) C(2) • C(5) = C(10) (d) C(r) • C(l) = C(r) 

23. Prove: (a) If C6i; + , then —CG%~. 

(b) If C6T, then -C €•*:+. 

24. Prove: —(—€) = C 

25. Prove: (o) If C„ C 2 6 ■*:, then Cj + C 2 and |Ci| • |C 2 | are cuts. 

(b) If C t ^ C(0), then ICjl" 1 is a cut. 

(c) (C -1 ) -1 = C for all C -A C(0). 

26. Prove: (a) If C e % + , then C~ 1 G%+. 

(b) If Cer, then C" 1 £•*[-. 

27. Prove: If r,s G Q with r < *, there exists an irrational number a such that r < a < s. 

Hint. Consider a = r H- 

V2 

28. Prove: If A and B are real numbers with A < B. there exists an irrational number a such that 
A < a < B. 

Hint. Use Problem 12 to prove: If A and B are real numbers with A < B, there exist rational 
numbers t and r such that A < C(t) < C(r) < B. 


29. Prove: Theorem V, Page 69. 


Chapter 8 


The Complex Numbers 

THE SYSTEM C OF COMPLEX NUMBERS 

The system C of complex numbers is the number system of ordinary algebra. It is 
the smallest set in which, for example, the equation x 2 = a can be solved when a is any 
element of R. In our development of the set C, we begin with the product set R x R. The 
binary relation “ = ” requires 

(a, b) = (c, d) if and only if a = c and b = d 
Now each of the resulting equivalence classes contains but a single element. Hence, we 
shall denote a class as (a, b) rather than as [a, 6] and so, hereafter, denote R x R by C. 

ADDITION AND MULTIPLICATION ON C 

Addition and multiplication on C are defined respectively by 

(i) (a,b) + (c,d) = (a + c, b + d) 

(ii) (a, b) • (c, d) = ( ac - bd, ad + be) 

for all (a, b), (c, d) G C. 

The calculations necessary to show that these operations obey A,-A 4> M1-M4, D1-D2 of 
Chapter 7, when restated in terms of C, are routine and will be left for the reader. It is 
easy to verify that (0,0) is the identity element for addition and (1,0) is the identity 
element for multiplication; also, that the additive inverse of (a, b) is -(a, b) = (-a, -6) 

and the multiplicative inverse of (a, 5) ^ (0,0) is (a,b)~ l = - * ~- u ). Hence 

"f" 0 2 Or H” u 2 / 9 

the set of complex numbers have the properties A 5 -A 6 and M 5 -M 6 of Chapter 7 , restated 
in terms of C. 

We shall show in the next section that R C C, and one might expect then that C has 
all of the basic properties of R. But this is false since it is not possible to extend 
(redefine) the order relation “ < ” of R to include all elements of C. 

See Problem 1. 

PROPERTIES OF COMPLEX NUMBERS 

The real numbers are a proper subset of the complex numbers C. For, if in (i) and 
(ii) we take b = d — 0, we see that the first components combine exactly as do the real 
numbers a and c. Thus, the mapping a <-> (a, 0) is an isomorphism of R onto a certain 
subset {(a, 6): a e R, b - 0} of C. 

The elements (a, b) eC in which 6^0, are called imaginary numbers and those 
imaginary numbers (a, b) in which a — 0 are called pure imaginary numbers. 

For each complex number z = (a, 6), we define the complex number z = (575) = (a, -6) 
to be the conjugate of z. Clearly, every real number is its own conjugate while the 
conjugate of each pure imaginary is its negative. 
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There follow easily 

Theorem I. The sum (product) of any complex number and its conjugate is a real number. 
Theorem II. The square of every pure imaginary number is a negative real number. 

See also Problem 2. 

The special role of the complex number (1,0) suggests an investigation of another, 
(0,1). We find ^ ^ i) = ( -y, x) for every (x, y)GC 

and, in particular, 

(V, 0) * (0,1) = (0,1) • (y, 0) = ( 0 , 2 /) 

Moreover, (0, l) 2 = (0,1) *(0,1) = (-1,0)<^.-l in the mapping above so that (0,1) is a 
solution of z 2 = — 1. 

Defining (0,1) as the pure imaginary unit and denoting it by i, we have 
i 2 = —1 

and, for every ( x , y) € C, 

(x,V) = (*> 0) + (0, y) = ( x , 0) + (y, 0) • (0,1) = x + yi 
In this familiar notation, x is called the real part and y is called the imaginary part of the 
complex number. We summarize: 

the negative of z = x + yi is — z = —(x + yi) = —x — yi 
the conjugate of z = x + yi is z = x + yi = x-yi 
for each z = x + yi, z • z = x 2 + y 2 € R 

1 z x y 

for each z^0 + 0*i = 0, z 1 = - = —= = 2 . - 2 . 


SUBTRACTION AND DIVISION ON C 

Subtraction and division on C are defined by 

(iii) z — w = z + (—w), for all z,w GC 

(iv) z-r-w = z • w~ l , for all w ¥= 0, zGC 


TRIGONOMETRIC REPRESENTATION 

The representation of a complex number z by ( x , y) and by x + yi suggests the mapping 
(isomorphism) 

x + yi (x, y) 

of the set C of all complex numbers onto the points {x,y) of the real plane. We may 
therefore speak of the point P(x, y) or of P(x + yi) as best suits our purpose at the moment. 
The use of a single coordinate, surprisingly, often simplifies many otherwise tedious 
computations. One example will be discussed below; another will be outlined briefly in 
Problem 20. 


Consider in Fig. 8-1 the point P (x + yi) + 0 whose 
distance r from O is given by r = yjx 2 + y 2 . If 6 is the 
positive angle which OP makes with the positive 
x-axis, we have 

x — r cos 0, y = r sin 9 

whence z = x + yi — r(cos 8 + i sin 9) 

The right member of this equality is called the trigo¬ 
nometric form (polar form) of z. The non-negative 
real number 



V 

y 

- -+P(x + yi) 


/I . 

0 

Fig.8-1. 
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is called the modulus (absolute value) of z, and 0 is called the angle (amplitude or argument) 
of z. Now 6 satisfies x = r cos 9, y - r sin 0, tan 6 = y/x and any two of these 
determine 6 up to an additive multiple of 2n. Usually we shall choose as 9 the smallest 
positive angle. (Note. When P is at O, we have r = 0 and 9 arbitrary.) 

Example 1: Express (o) 1 + i, (6) —+ i in trigonometric form. 

(a) We have r = Vl + 1 = V2. Since tan 9 = 1 and cos 9 — \l\Jt, we take 9 to be 
the first quadrant angle 45° = v/4. Thus, 1 + i = \/2(coS7r/4 4- i sin ir/4). 

(b ) Here r = V3 + 1 = 2, tan 9 = —l/yfz and cos 9 = —£l/3. Taking 9 to be the 
second quadrant angle 5 jt/ 6, we have 

-yjl + i = 2(cos 5 jt/ 6 + t sin 5v/6) 

It follows that two complex numbers are equal if and only if their absolute values are 
equal and their angles differ by an integral multiple of 2tt, i.e., are congruent modulo 2tt. 

In Problems 3 and 4, we prove 

Theorem III. The absolute value of the product of two complex numbers is the product 
of their absolute values, and the angle of the product is the sum of their 
angles. 

and 

Theorem IV. The absolute value of the quotient of two complex numbers is the quotient 
of their absolute values, and the angle of the quotient is the angle of the 
numerator minus the angle of the denominator. 

Example 2: (o) When z t = 2(cosJir + isin^jr) and z 2 = 4(cos 4- isin Jir), we have 

z t ‘z 2 = 2(cos Jv + i sin £*■) • 4(cos fir + isinfv) 

= 8 (cos jr + tsinir) = —8 
z 2 /z t = 4(cos f tt + isin fa-) = 2(cos {v + isinja) 

= 2(cos + i sin = 2i 

Z\/»2 = £(cos (—£sr) + i sin (—= £(cos 3v/2 + i sin 3 jt/ 2) — -£i 

(6) When z = 2(cos jt/ 6 + isinir/6), 

z 2 = z * z = 4(cos ir/3 + isinv/3) = 2(1 + i\/3) 
and z 3 = z 2 *z = 8(cos Ivr + isin^v) = 8i 

As a consequence of Theorem IV, we have 
Theorem V. If n is a positive integer, 

[r(cos 9 + i sin 0)]" = r n (cos n9 + i sin n9) 

ROOTS 

The equation z” = A = p(cos <j> + i sin <t>) 

where n is a positive integer and A is any complex number, will now be shown to have 
exactly n roots. If z = r( cos 9 + i sin 0) is one of these, we have by Theorem V, 
r"(cos nO + i sin «0) = P (cos <£ + i sin 4>) 

Then r" = p and nO — <j> + 2kn (k, an integer) 


so that 


and 


<j»/n + 2 kir/n 
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The number of distinct roots are the number of non-coterminal angles of the set 
{<f>/n + 2kir/n}. For any positive integer k = nq + m, 0^m<n, it is clear that 
<t>/n + 2kv/n and <j>/n + 2mir/n are coterminal. Thus, there are exactly n distinct roots, 
given by 

P l/n [cos ( <j>/n + 2kir/n) + i sin + 2kir/n)], k — 0,1,2,3, ..., n — 1 
These n roots are coordinates of n equispaced points on the circle, centered at the origin, 
of radius y/\A\- If then z = \f\A\ (cos 6 + i sin 6) is any one of the nth roots of A, the 

remaining roots may be obtained by successively increasing the angle 9 by 2-nln and 

reducing modulo 2tt whenever the angle is greater than 2tt. 

Example 3: (a) One root of z 4 = 1 is r, = 1 = cosO + tsinO. Increasing the angle successively 

by 2v/4 = ir/2, we find r 2 = cos \tt + i sin ^ir, r a = cos w + t sin v, and 
r 4 = cos 3v/2 + i sin 3v/2. Note that had we begun with another root, say 
—1 = cos jr + i sin it , we would obtain cos3ir/2 + i sin 3 t/2, cos 2ir + i sin 2ir = 

cos 0 + i sin 0, and cos + » sin These are, of course, the roots obtained 

above in a different order. 

(b) One of the roots of z 6 = —4y/3 — 4 i = 8(cos 7ir/6 + i sin 7v/6) is 
r t = a/2(cos7v/ 36 + isin 7v/36) 

Increasing the angle successively by 2v/6, we have 

r 2 = i/2 (cos 19>r/36 + i sin 19jr/36) 
r a = V2 (cos 3W36 + isin 3W36) 
r 4 = yf2 (cos 43 v/36 + i sin 43 jt/36) 
r a = \[% (cos 55^/36 + i sin 55ir/36) 
r 8 = y/2 (cos 67v/36 + t sin 67s-/36) 


As a consequence of Theorem V, we have 
Theorem VI. The n nth roots of unity are 

p = cos 2-rr/n + i sin 2ir/n, p 2 , p 3 , p 4 .p"“‘, p" = 1 

PRIMITIVE ROOTS OF UNITY 

An nth root z of 1 is called primitive if and only if z m ¥* 1 when 0 < m < n. Using 
the results of Problem 5, it is easy to show that p and p 5 are primitive sixth roots of 1 
while p 2 , p 3 , p 4 , p 6 are not. This illustrates 

Theorem VII. Let p = cos 2n/n + i sin 2ir/n. If (m,n) = d > 1, then p™ is an n/dth 
root of 1. For a proof, see Problem 6. 

There follows 

Corollary. The primitive nth roots of 1 are those and only those nth roots p, p 2 , p 3 , ..., p" 
of 1 whose exponents are relatively prime to n. 

Example 4: The primitive 12th roots of 1 are 

P = cos 2v/12 + i sin 2a-/12 = £^3 + £* 
p 5 = cos 5v/6 + i sin 5W6 = -£^3 + 
p 7 - cos 7v/6 + i sin 7v/6 -- — ^V3 — 
pH = cosllv/6 + isinllv/6 = %\[% - 
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Solved Problems 


1. Express in the form x + yi: 

(a) 3 - 2 V / Z 1, ( b) 3 + v^i, (c) 5, 

(а) 3 - 2yT^ = 3 - 2» 

(б) 3 + V=4 = 3 + 2i 


^ 3 + 4 i’ 2 — 3i ’ W i3 ‘ 

/ v 5-i _ (5-«)(2 + 3t) _ 13 + 13i 
K l 2-31 ~ (2 -3i)(2 + 3i) “ 13 

(/) i3 = = -* = 0-t 


(c) 5 = 5 + 0 • i 

(d) 1 _ _ 3 ~ 4 « _ 8-4t _ 3 4 

V ’ 3 + 4t (3 + 4i)(3 — 4i) “ "aT" _ 25 25* 


1 + i 


2. Prove: The mapping z +* z, z €E C is an isomorphism of C onto C. 

We are to show that addition and multiplication are preserved under the mapping. This follows 
since for z, = *, + y t i, z 2 = x 2 + y 2 i e C, 

+ ** = (*i + Vl«) + (* 2 + Vtf) = (*i + * 2 ) + <3/i + Vz)i 

= (x, + * 2 ) - (vj + j/ 2 )t = (x, - j/,i) + (x 2 - y 2 i) = z, + z 2 

and *1 * *2 = ( X l X 2 ~ V1V2) + ( X lV2 + x 2Vl)i = i x l x 2 ~ V1V2) ~ ( x iV 2 + x 2Vi)i 

= ( x l - Vli) • ( x 2 - V2 *) = *1 • 2 2 


3. Prove: The absolute value of the product of two complex numbers is the product of 
their absolute values, and the angle of the product is the sum of their angles. 

Let z, = r, (cos «, + i sin #,) and z 2 = r 2 (cos $ 2 + i sin « 2 ). Then 

*1 • *2 = »W(cos «t • cos t 2 - sin tf, • sin « 2 ) + i(sin «, • cos U 2 + sin s 2 • cos »,)] 

= r,r 2 [cos (», + tf 2 ) + isin(tfj + tf 2 )] 


4. Prove: The absolute value of the quotient of two complex numbers is the quotient of 
their absolute values, and the angle of the quotient is the angle of the numerator minus 
the angle of the denominator. 

For the complex numbers z, and z 2 of Problem 3, 

Zi_ _ r,(cos «, + i sin tf,) _ r,(cos «, + i sin *,)(cos e 2 - i sin e 2 ) 
z 2 r 2 (cos e 2 + i sin e 2 ) ~ r 2 (cos e 2 + i sin s 2 )(cos e 2 - i sin e 2 ) 

— ~ [( cos 9 1 cos » 2 + sins, sin t 2 ) + i(sin 0, cos 0 2 - sin e 2 coss,)] 

= y- [cos («, — S 2 ) + i sin (», — fl 2 )] 


5. Find the 6 sixth roots of 1 and show that they include both the square roots and the 
cube roots of 1. 

The sixth roots of 1 are 

p = cos v/3 + i sin tt/3 = \ + £\/3i p 4 = cos 4v/3 + i sin 4v/3 = -1 - ^V3i 

P 2 = cos 2 jt/3 + i sin 2?r/3 = -i + ly/si p s = cos 5v/3 + t sin 5v/3 = £ - 

P 3 = cos IT + i sin w = -1 p« = cos 2v + i sin 2w — 1 

Of these, p 3 = —1 and p® = 1 are square roots of 1 while p 2 = —| + ^2 i, p * = —^ an d 

p6 — 1 are cu b e roo ts of 1. 
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6. Prove: Let P = cos 2-n/n + i sin 2-n/n. If (ra, n) = d > 1, then p m is an n/d th root of 1. 

Let m = m,d and n = n,d. Since p m — cos 2mv/n + 1 sin ‘Lm.TrIn — cos 2m 1 5r/n, + 1 sin 2m l v/n l 
and (p m ) n i = cos 2m ,v + i sin 2m,*- = 1, it follows that p m is an n, = n/d th root of 1. 


Supplementary Problems 

7. Express each of the following in the form x + yi: (a) 2 + y/~ 5, (6) (4 + yf^b ) + (3 — 2yf—5 ), 

(0) (4+ V=5) “ (3-2a/= 5), <d) (3 + 41) • (4 — 51), ( 0 ) (/) §±|j, to) f^fj. W * 4 > 

(t) 1*, (i) (fc) i 8 . 

A ns. (a) 2 + V8*. to) 7-^51, (c) 1 + 3^5*, (d) 32 + 1, (e) 2/13 + 31/13, (/) 4/29 + 191/29, 

to) 4/13 - 191/13, (h) 1 + 0*1, (i) 0 + 1, 0) -1 + 0*1, (fe) 1 + 0 • 1 

8. Write the conjugate of each of the following: (a) 2 + 31, (6) 2 — 31, (c) 5, (d) 21. 

Ans. (a) 2 — 31, (6) 2 + 31, (c) 5, (d) —21 

9. Prove: The conjugate of the conjugate of * is z itself. 

10. Prove: For every 2 # 0 6 C, (z -1 ) = (z) _1 . 

11. Locate all points whose coordinates have the form (a) (a+ 0*1), (6) (0 + 61), where a, b G R. 
Show that any point s and its conjugate are located symmetrically with respect to the x-axis. 

12. Express each of the following in trigonometric form: 

(а) 6 (c) -l-i/lli (e) -6 (g) -3 + ^31 (1) 1/1 

(б) 4-41 (d) -31 (f) y/2 + y/2i (h) -1/(1 +1) 

Ans. (a) 5 cis 0 (d) 3 cis 3W2 to) 2yf& cis 5v/6 

(6) 4\/2 cis 7v/4 (e) 6cisir ( h ) ^2/2 cis 3ir/4 

(c) 2 cis 4 tt/ 3 (/) 2 cis v/4 (1) cis 3ir/2 

where cis 0 = cos 9 + 1 sin 9. 

13. Express each of the following in the form a+bi: 

(а) 5 cis 60° (e) (2 cis 25°) • (3 cis 336°) 

(б) 2 cis 90° (/) (10 cis 100°) • (cis 140°) 

(c) cis 150° to) (6 cis 170°) + (3 cis 50°) 

(d) 2 cis 210° ( h ) (4 cis 20°) (8 cis 80°) 

Ans. (a) 5/2 + 5\/31/2 (c) -£i/3 + |i (e) 6 

(6) 21 (d) -yfZ - i (/) -5 - 5^31 

14. Find the cube roots of: (a) 1, (6) 8, (c) 271, (d) —81, (e) —lyfb — 41. 

Ans. (a) -i ± 1^31, 1; (6)-l±VSt. 2; (c) ± ^ +1 1 , -31; (d) 2i, ±y/s - i; (e) 2 cis 7*718, 
2 cis 19*718, 2 cis 31*718 

15. Find (a) the primitive fifth roots of 1, (6) the primitive eighth roots of 1. 

16. Prove: The sum of the n distinct nth roots of 1 is zero. 


to) -1 + y/3 i 
(h) i-i>/3i 
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17. Use Fig. 8-2 to prove: 

(a) j Zl + z 2 l s* |zj| + \z 2 \ 

(b) |z, - z 2 | |z,| - \z 2 \ 


18. If r is any cube root of a G C, then r, ar, u 2 r, where 
“ ~ ~i + 2 ^^ 1 and “ 2 are the imaginary cube roots 
of 1, are the three cube roots of a. 


19. Describe geometrically the mappings 

(a) z->z (6) z -*zi (c) z-*zi 



20 . In the real plane let K be the circle with center 
at 0 and radius 1 and let A,A 2 A 3 , where j/j) = 

Aj(Zj) = A,!*,- + Vj i), j = 1 , 2 , 3 , be an arbitrary in¬ 
scribed triangle. Denote by P(z) = P{ x + yi) an 
arbitrary (variable) point in the plane. 

(a) Show that the equation of A is z • z = 1. 


(b) Show that P r (x r , y r ) t where 
+ by k 


and y r -__±, divides the line segment 

AjA k in the ratio b:a. Then, since A,,A k and 
/azj + bz k \ 

rr \a+b ) 16 on the line A Ak, verify that 

its equation is z + Xj z k z = Xj + z k . 



Fig. 8-3. 


(o) 

(d) 

(•) 


Verify: The equation of any line parallel to A } A k has the form z + z,z k z = , by showing that 
the midpoints B i and B k of A,Aj and A { A k lie on the line z + zjz k z = |(z, + z j + z k + z,zjz k ). 

Verify: The equation of any line perpendicular to AjA k has the form 
that 0 and the midpoint of AjA k lie on the line z — z } z k z = 0. 


~ZjZ k z = ji by showing 


Use z = z f in z — Zf k z = 
A \A 2 A 3 through A f . Then 
common point H(z l + z 2 + j 


f. 10 obtam the equation z-zf k z = z, - z t z } z k of the altitude of 
eliminate z between the equations of any two altitudes to obtain their 
z 3 ). Show that H also lies on the third altitude. 




Chapter 9 


Groups 


GROUPS 

A non-empty set q on which a binary operation ° is defined is said to form a group 
with respect to this operation provided, for arbitrary a,b, c G Q, the following properties 
hold: 

Pi: (a ° b) o c = a ° (6 ° c) (associative law) 

P 2 : There exists u G Q such that ao u = u°a = a 

(existence of identity element) 

P 3 : For each aG q there exists a -1 G q such that a°a _l = a _1 oa = u 

(existence of inverses) 

Note 1. The reader must not be confused by the use in Pa of a -1 to denote the inverse 
of a under the operation o. The notation is merely borrowed from that previously used 
in connection with multiplication. Whenever the group operation is addition, or* is to be 
interpreted as the additive inverse -a. 

Note 2. The preceding chapters contain many examples of groups for most of which 
the group operation is commutative. We therefore call attention here to the fact that 
the commutative law is not one of the requisite properties listed above. A group is called 
abelian or non-abelian according as the group operation is or is not commutative. For 
the present, however, we shall not make this distinction. 

Example 1: (o) The set I of all integers forms a group with respect to addition; the identity 

element is 0 and the inverse of aG I is -a. Thus, we may hereafter speak of 
the additive group I. On the other hand, I is not a multiplicative group since, 
for example, neither 0 nor 2 has a multiplicative inverse. 

(6) The set A of Example 12(e), Chapter 2, forms a group with respect to °. The 
identity element is a. Find the inverse of each element. 

(c) The set A = {-3, -2, —1,0,1,2, 3} is not a group with respect to addition on 
I although 0 is the identity element, each element of A has an inverse, and addi¬ 
tion is associative. The reason is, of course, that addition is not a binary opera¬ 
tion on A , that is, the set A is not closed with respect to addition. 

(i d) The set A = {w, — — ^ + frfli, <o 2 = - £ - 
of 1 forms a group with respect to multiplication on 
the set of complex numbers C since (i) the product of 
any two elements of the set is an element of the set, 

(ii) the associative law holds in C and, hence, in A, 

(iii) <o 3 is the identity element, and (iv) the inverses 
of <o x , « 2 ,“3 are w 2 > “i> “3 respectively. 

This is also evident from (ii) and the adjacent 
operation table. 


, = 1} of the cube roots 


Table 9-1 

See also Problems 1-2. 
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SIMPLE PROPERTIES OF GROUPS 

The uniqueness of the identity element and of the inverses of the elements of a group 
were established in Theorems III and IV, Chapter 2, Page 20. There follow readily 


Theorem I (Cancellation Law). If a,b,c G Q, then ao b = a°c, (also, boa = cod), implies 

- Q 

For a proof, see Problem 3. 


Theorem II. For a,b GQ, each of the equations 
solution. 


ox = b and y ° a = b has a unique 
For a proof, see Problem 4. 


Theorem III. For every a G q, the inverse of the inverse of a is a, i.e., (a -1 ) -1 = a. 
Theorem IV. For every a,bGQ, (ao&)-‘ = b~ 1 oa~ l . 

Theorem V. For every a,b,.. ,,p,q G q, (aobo • • • opoq)~ l = q~ 1 op~ 1 o ■ ■ ■ ob _1 °a -1 . 

For any a G g and m G I + , we define 

a m = ooooflo • • • oa to m factors 
a 0 = u, the identity element of q 

a~ m = (a -1 )"* = a -1 oa -1 oa -1 ° • • • oa -1 to m factors 

Once again the notation has been borrowed from that used when multiplication is the 
operation. Whenever the operation is addition, o" when w > 0 is to be interpreted as 
na = a + a + a+-'-+a ton terms, a 0 as u, and a" as «(—a) = — a + (-a) + (-a) + • • • + (-a) 
to n terms. Note that na is also shorthand and is not to be considered as the product of 
n G I and a G g. 

In Problem 5 we prove the first part of 

Theorem VI. For any a G q, (i) a m oa n = a m+n and (ii) (a m ) n = o mn , where m,nGl. 

By the order of a group g is meant the number of elements in the set Q. The additive 

group 1 of Example 1(a) is of infinite order; the groups of Example 1(b) and 1(d) are 
finite groups of orders 5 and 3 respectively. 

By the order of an element a G q is meant the least positive integer n, if one exists, 
for which a” = u, the identity element of q. If a ¥* 0 is an element of the additive group I 
then, since na¥= 0 for every positive integer n, the order of a is infinite. The element ^ 
of Example 1(d) is of order 3 since «>, and «> 2 are different from 1 while a>® = 1, the identity 
element. 


SUBGROUPS 

Let g = {a,b,c, ...} be a group with respect to °. Any non-empty subset Q' of Q is 
called a subgroup of q if Q' is itself a group with respect to °. Clearly q’ = {u}, where 

u is the identity element of q, and q itself are subgroups of any group q. They will be 

called improper subgroups; other subgroups of q, if any, will be called proper. We note 
in passing that every subgroup of a group q contains u as its identity element. 

Example 2: (a) A proper subgroup of the multiplicative group Q = {1, — 1, i, — t} is Q' = {1, —1}. 

(Are there others?) 

(6) Consider the multiplicative group Q = (p, p 2 , p 3 , p 4 , p 5 , p 8 = 1} of the sixth roots 

of unity (see Problem 5, Chapter 8, Page 79). It has Q' = {p 3 , p 6 } and 

Q " = {p 2 ,p 4 ,p 8 } as proper subgroups. 

The next two theorems are useful in determining whether a subset of a group q with 
group operation o is a subgroup. 
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Theorem VII. A non-empty subset Q’ of a group q is a subgroup of q if and only if 
(i) q' is closed with respect to °, (ii) q' contains the inverse of each of its 
elements. 


Theorem VIII. A non-empty subset q' of a group q is a subgroup of q if and only if 
for all a, b G q’, a -1 o b G q'. 


There follow 


For a proof, see Problem 6. 


Theorem IX. 
Theorem X. 


Let a be an element of a group q. The set q' = [a n : n G 1} of all integral 
powers of a is a subgroup of q. 

If S is any set of subgroups of a group q, the intersection of these sub¬ 
groups is also a subgroup of q. „ , _ ,. 

For a proof, see Problem 7. 


CYCLIC GROUPS 

A group q is called cyclic if, for some a G q, every x G q is of the form a m , where 
w G /. The element a is then called a generator of q. Clearly, every cyclic group is 
abelian. 

Example 3: (a) The additive group I is cyclic with generator a = 1 since, for every rnG I, 

a m = ma — m. 


(6) The multiplicative group of fifth roots of 1 is cyclic. Unlike the group of (a) 
which has only 1 and —1 as generators, this group may be generated by any 
of its elements except 1. 

(e) The group Q of Example 2(6) is cyclic. Its generators are p and p 5 . 

Examples 3(6) and 3(c) illustrate 

Theorem XI. An element c‘ of a finite cyclic group q of order n is a generator of q if 
and only if (n, t) = 1. 


In Problem 8, we prove 

Theorem XII. Every subgroup of a cyclic group is itself a cyclic group. 


PERMUTATION GROUPS 

The set S„ of the n ! permutations of n symbols was considered in Chapter 2. A review 
of this material shows that S n is a group with respect to the permutation operation °. 
Since ° is not commutative, this is our first example of a non-abelian group. 

It is customary to call the group S n the symmetric group on n symbols and to call any 
subgroup of S n a permutation group on n symbols. 

Example 4: (a) S 4 = {(1), (12), (13), (14), (23), (24), (34), a = (123), a 2 = (132), 0 = (124), 

0 2 = (142), y = (134), y2 = (143), 8 = (234), 8 2 = (243), p = (1234), 
P 2 = (13)(24), p3 = (1432), a = (1243), <r 2 = (14)(23), *3 = (1342), 
r = (1324), r 2 = (12)(34), r 2 = (1423)}. 

(6) The subgroups of S 4 : (i) {(1), (12)}, (ii) {(1), a, a 2 }, (in’s |.(i), (12), (34), (12)(34)}, 
and (iv) A 4 = {(1), a, a 2 0, 0 2 , y, y 2 , 8, 8 2 , p 2 , a 2 , r 2 } are examples of permuta¬ 
tion groups on 4 symbols. (A 4 consists of all even permutations in S 4 and is 
known as the alternating group on 4 symbols.) Which of the above subgroups 
are cyclic? which abelian? List other subgroups of S 4 . 

See Problems 9-10. 


HOMOMORPHISMS 

Let q, with operation °, and q' with operation a , be two groups. By a homomorphism 
of q into q' is meant a mapping 

- 9 ^ 9 ' 
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such that 

(i) every g G q has a unique image g' G (]' 

(ii) if a — a' and b — b', then flo6-»a'ab' 

If, in addition, the mapping satisfies 

(iii) every g' G Q' is an image 

we have a homomorphism of Q onto Cj' and we then call Q' a homomorphic image of Q. 

Example 5: (a) Consider the mapping n — i n of the additive group I onto the multiplicative 

group of the fourth roots of 1. This is a homomorphism since 

and the group operations are preserved. 

(b) Consider the cyclic group Q = {a, a 2 , a 3 , ..., a 12 — u } and its subgroup 
Q' — {a 2 , a 4 , a 6 , ...,o 12 }. It follows readily that the mapping 


is a homomorphism of Q onto Q' while the mapping 
is a homomorphism of Q' into Q. 

See Problem 11. 


In Problem 12 we prove 

Theorem XIII. In any homomorphism between two groups Q and Q', their identity 
elements correspond; and if xGQ and x' G Q' correspond, so also do 
their inverses. 


There follows 

Theorem XIV. The homomorphic image of any cyclic group is cyclic. 


ISOMORPHISMS 

If the mapping of the section above is one-to-one, i.e., 


g ** 9' 

such that (iii) is satisfied, we say that Q and Q’ are isomorphic and call the mapping an 
isomorphism. 


Example 6: 


Show that Q, the additive group //(4), is isomorphic to Q’, the multiplicative group 
of the non-zero elements of //(5). 

Consider the operation tables 


Q 

+ | 0 1 2 3 

0 0 12 3 

112 3 0 

2 2 3 0 1 

3 3 0 1 2 


Q’ 

• | 1 3 4 2 

113 4 2 

3 3 4 2 1 

4 4 2 1 3 

2 2 13 4 


Table 9-2 


Table 9-3 


in which, for convenience, [0], [1],... have been replaced by 0,1, ... . It follows 
readily that the mapping 

Q^Q’ : 0-»l,l-»3, 2-»4, 3->2 
is an isomorphism. For example, l=2 + 3->4«2 = 3, etc. 

Rewrite the operation table for Q' to show that 

Q^Q’ : 0-1, l-»2, 2->4, 3->3 
is another isomorphism of g onto g'. Can you find still another? 
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In Problem 13 we prove the first part of 

Theorem XV. (a) Every cyclic group of infinite order is isomorphic to the additive 
group /. 

(6) Every cyclic group of finite order n is isomorphic to the additive 
group I/(n). 

The most remarkable result of this section is 
Theorem XVI (Cayley). Every finite group of order « is isomorphic to a permutation 
group on n symbols. 

Since the proof, given in Problem 14, consists in showing how to construct an effective 
permutation group, the reader may wish to examine first 


Example 7: 


Consider the adjacent operation table for 
the group g = {g u g 2 , g 3 , g 4 , g s , g 6 } with 
group operation a. 

The elements of any column of the ta¬ 
ble, say the fifth: g 1 °g s = g it 92 ° 05 = g 3 , 
9a ° 05 = 04, 04 ° 9a = 9a, 9a°9a = 0e, 

06° 9a = Oi are the elements of the row 
labels (i.e., the elements of Q) rearranged. 
This permutation will be indicated by 


Oi 
9z 
9a 
04 
9s 
06 


01 

01 

02 

03 

04 

9a 

06 


02 

02 

01 

06 

9a 

04 

9a 


JH _ 04_ 

03 04 

05 06 

01 03 

06 01 
02 03 

04 02 


9a 06 
05 06 

03 04 

04 02 

02 03 

06 01 
01 Oa 


(O' 02 0a 04 05 * 6 \ = (156)(234) Table 9-4 

\05 03 04 02 06 01J 

= Pa 

It follows readily that Q is isomorphic to P = {p lt p 2 , p 3 , p 4 , p 5 , p 8 } where p, = (1), 
p 2 = (12)(36)(45), p 3 = (13)(25)(46), p 4 = (14)<26)(35), p 5 = (156)(234), p 8 = (165)(243) 
under the mapping 

Oi +* Pi (t = 1,2,3, .. .,6) 


COSETS 

Let g be a finite group with group operation o, H be a subgroup of Q, and a be an 
arbitrary element of Q. We define as the right coset Ha of H in Q, generated by a, the 
subset of Q Ha _ [hoa . heH] 

and as the left coset aH of H in g, generated by a, the subset of g 
aH = {aoh: hGH} 


Example 8: The subgroup H = {(1), (12), (34), (12)(34)} and the element a = (1432) of S 4 gen¬ 
erate the right coset 

Ha = {(1) o (1432), (12) o (1432), (34) ° (1432), (12)(34) ° (1432)} 

= ((1432), (243), (142), (24)} 

and the left coset 

aH = {(1432) ° (1), (1432) ° (12), (1432) ° (34), (1432) ° (12)(34)} 

= {(1432), (143), (132), (13)} 

In investigating the properties of cosets, we shall usually limit our attention to right 
cosets and leave for the reader the formulation of the corresponding properties of left 
cosets. First, note that a G Ha since u, the identity element of g, is also the identity 
element of H. If H contains m elements so also does Ha, for Ha contains at most m 
elements and hi°a = h 2 °a for any hi,h 2 GH implies hi = h 2 . Finally, if C r denotes 
the set of all distinct right cosets of H in q, then H G C r since Ha = H when a e H. 

Consider now two right cosets Ha and Hb, a^6, of F in g. Suppose that c is a 
common element of these cosets so that for some h u h 2 G H we have c = hi°a = h 2 °b. 
Then a = hr l °(h 2 ob) = (hf'oh^ob and, since hi l °h 2 G H (Theorem VIII), it follows 
that a G Hb and Ha = Hb. Thus C r consists of mutually disjoint right cosets of q and so is 
a partition of g. We shall call C r a decomposition of q into right cosets with respect to H. 
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Example 9: («) Take Q as the additive group of integers and H as the subgroup of all 

integers divisible by 5. The decomposition of Q into right cosets with re¬ 
spect to H consists of the five residue classes modulo 5, i.e., H — {x : 5 | x}, 
HI = {x: 5 | (x — 1)}, H2 = {* : 5 | (x - 2)}, HZ = {x : 5 | (x - 3)>, and 
H4 = {x: 5 | (at — 4)}. There is no distinction here between right and left cosets 
since Q is an abelian group. 

( b) Let g = S 4 and H = A 4 , the subgroup of all even permutations of S 4 . Then 
there are only two right (left) cosets of g generated by H, namely, A 4 and the 
subset of odd permutations of S 4 . Here again there is no distinction between 
right and left cosets but note that S 4 is not abelian. 

(c) Let g = S 4 and H be the octic group of Problem 9. The distinct right cosets 
generated by H are 

H = {(1), (1234), (13)(24), (1432), (12)(34), (14)(23), (13), (24)} 

H( 12) = {(12), (234), (1324), (143), (34), (1423), (132), (124)} 

H( 23) = {(23), (134), (1243), (142), (1342), (14), (123), (243)} 

and the distinct left cosets generated by H are 

H = {(1), (1234), (13)(24), (1432), (12)(34), (14)(23), (13), (24)} 

(12 )H = {(12), (134), (1423), (243), (34), (1324), (123), (142)} 

(23 )H = {(23), (124), (1342), (143), (1243), (14), (132), (234)} 

Thus, G = HU H( 12) U H( 23) — Hu (12 )H U (23 )H. Here, the decomposition 
of g obtained by using the right and left cosets of H are distinct. 


Let Q be a finite group of order « and H be a subgroup of order m of Q. The number 
of distinct right cosets of H in Q (called the index of H in Q) is r where n = mr\ hence, 

Theorem XVII (Lagrange). The order of each subgroup of a finite group Q is a divisor 
of the order of Q. 

As consequences, we have 

Theorem XVIII. If $ is a finite group of order n, then the order of any element a G Q 
(i.e., the order of the cyclic subgroup generated by a) is a divisor of n. 

and 

Theorem XIX. Every group of prime order is cyclic. 


INVARIANT SUBGROUPS 

A subgroup H of a group Q is called an invariant subgroup (normal subgroup or 
normal divisor) of Q if 

(i) gH = Hg for every g € q 

Since g~* e g whenever g € Q, (i) may be replaced by 

(i') 9 ~ l Hg - H for every g G Q 

Now (i') requires 

(iO for any g eg and any h G H, then g l oh°g GH 

and 

(iz) for any g G Q and each hGH, there exists some kGH such that g~ 1 °kog - h 
or k o g ~ g o h. 

We shall show that (i() implies (i' 2 ). Consider any hGH. By (i,'), (g~ 1 )~ 1 ohog~i = 
gohog- 1 = kGH since g~ x G Q\ then g~ x okog — h as required. We have proved 
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Theorem XX. If H is a subgroup of a group q and if g l °h°g G H for all g G q and 
all h GH, then H is an invariant subgroup of Q. 

Example 10: (a) Every subgroup H of an abelian group Q is an invariant subgroup of Q since 

g a h — h o g, for any g G Q and every hG H. 

(6) Every group Q has at least two invariant subgroups: {w}, since u°g = g °u for 
every g G Q, and Q itself, since for any g,h G Q we have 

g oh = g°h°(g~'°g) = (g°h°g-')°g = k°g and k = g°h°g-iGg 

(e) If H is a subgroup of index 2 of Q [see Example 9(6)] the cosets generated by H 
consist of H and g — H. Hence, H is an invariant subgroup of Q. 

(rf) For Q = {a, a 2 , a 3 , a 12 = it), its subgroups {«, a 2 , a 4 , . . ., a 10 }, {u, a. 3 , a 6 , a 9 }, 

(it, a 4 , a 8 }, and {u, a 6 } are invariant. 

(e) For the octic group (Problem 9), {u,p 2 ,o 2 ,t 2 }, {u,p 2 ,b,e} and {u,p,p 2 ,p 3 } 

are invariant subgroups of order 4 while {u, p 2 } is an invariant subgroup of 
order 2. (Use Table 9-7 to check this.) 

(/) The octic group P is not an invariant subgroup of S 4 since for p = (1234) G P 
and (12) G S 4 , we have (12)" 1 p (12) = (1342) <Z P. 

In Problem 15, we prove 

Theorem XXI. Under any homomorphism of a group q with group operation ° and 
identity element u into a group Q' with group operation a and identity 
element u’, the subset S of all elements of Q which are mapped onto u’ 
is an invariant subgroup of Q. 

The invariant subgroup of Q defined in Theorem XXI is called the kernel of the 
homomorphism. 

In Example 10(6) it was shown that any group Q has {u} and q itself as invariant 
subgroups. They are called improper while other invariant subgroups, if any, of q are 
called proper. A group q having no proper invariant subgroups is called simple. 


QUOTIENT GROUPS 

Let H be an invariant subgroup of a group q with group operation o and denote by 
q/H the set of (distinct) cosets of H in q, i.e., 

q/H = {Ha,Hb,Hc, ...} 

We define the “product” of pairs of these cosets by 

(Ha)(Hb) = {{hi°a)°(h 2 ob) : huhiEH) for all Ha, Hb G q/H 
and, in Problem 16, prove that this operation is well-defined. 

Now q/H is a group with respect to the operation just defined. To prove this, we 
note first that 

(hiod)o(h 2 ob) = hio(aoh 2 )ob = ftio(ft 3 oa)<>& 

= (hi o h 3 ) o (a o 6) = fc 4 °(ao&), h^h^GH 

Then (Ha)(Hb) = H(aob)Gq/H 

and [(Ha)(Hb)}(Hc) = H[(aob)oe] = H[ao{bo C )] = (Ha)[(Hb)(Hc)} 

Next, for u the identity element of q, ( Hu)(Ha) = (Ha)(Hu) = Ha so that Hu = H is the 
identity element of q/H. Finally, since (Ha)(Ha l ) — (Ha l )(Ha) = Hu = H, it follows 
that q/H contains the inverse Ha 1 of each Ha G q/H. 

The group q/H is called the quotient group (factor group) of q by H. 
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Example 11: (a) When g is the octic group of Problem 9 and H = {u, p 2 , 6, e}, then g/H — 
{H,Hp}. This representation of g/H is, of course, not unique. The reader 
will show that g/H = {H, Hp 2 } - {H, Ha 2 } = {H, Hr 2 } = { H,Hp }. 

(6) For the same g and H = {m, p 2 }, we have 

g/H = {H, Hp, Ha 2 , Hb} = {H, Hp 3 , Hr 2 , He} 

The examples above illustrate 

Theorem XXII. If H, of order m, is an invariant subgroup of q, of order n, then the 
quotient group g/H is of order n/m. 

From (Ha)(Hb) = H(a ° 6) G g/H, obtained above, there follows 
Theorem XXIII. If H is an invariant subgroup of a group g, the mapping 

Q-*q/H: g -» Hg 

is a homomorphism of q onto g/H. 

In Problem 17, we prove 

Theorem XXIV. Any quotient group of a cyclic group is cyclic. 

We leave as an exercise the proof of 

Theorem XXV. If H is an invariant subgroup of a group q and if H is also a subgroup 
of a subgroup K of q, then H is an invariant subgroup of K. 

PRODUCT OF SUBGROUPS 

Let H = {hi, h 2 ,..., hr} and K = {b u b 2 , .. .,b p } be subgroups of a group q and 
define the “product” 

HK = {hiobr. hiGH, bj G K) 

In Problems 65-67, the reader is asked to examine such products and, in particular, 
to prove 

Theorem XXVI. If H and K are invariant subgroups of a group q, so also is HK. 
COMPOSITION SERIES 

An invariant subgroup H of a group q is called maximal provided there exists no 
proper invariant subgroup K of q having H as a proper subgroup. 

Example 12: (a) A± of Example 4(6) is a maximal invariant subgroup of S 4 since it is a sub¬ 

group of index 2 in S 4 . Also, {u, p 2 , <r 2 , r 2 } is a maximal invariant subgroup 
of A 4 . (Show this.) 

(6) The cyclic group g = {u,a,a 2 , .. ,,a n } has H = {u,a 2 ,a 4 , .. .,a 10 } and 
K — {u, o 3 , a 6 , a 9 } as maximal invariant subgroups. Also, J = {u, a*, a 8 } is a 
maximal invariant subgroup of H while L = {u, a 6 } is a maximal invariant 
subgroup of both H and K. 

For any group q a sequence of its subgroups 

q, H, J, K, ..U = {u} 

will be called a composition series for q if each element except the first is a maximal 
invariant subgroup of its predecessor. The groups q/H, H/J, J/K, ... are then called 
the quotient groups of the composition series. 

In Problem 18 we prove 

Theorem XXVII. Every finite group has at least one composition series. 
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Example 13: (a) The cyclic group Q = {m, a, a 2 , a 3 , a*} has only one composition series: Q,U = {«}. 

(6) A composition series for Q — S 4 is 

S t , A 4 , {(1), p*. r*}, {(1), p3>, U = {(1)> 

Is every element of the composition series an invariant subgroup of Q ? 

(c) For the cyclic group of Example 12(b), there are three composition series: 
(i) g,H,J, U, (ii) g,K,L, I7, . (iii) g,H,L,U. Is every element of each composi¬ 
tion series an invariant subgroup of g t 

In Problem 19, we illustrate 

Theorem XXVIII (The Jordan-Holder Theorem). For any finite group with distinct 
composition series, all series are of the same length, i.e., have the same 
number of elements. Moreover, the quotient groups for any pair of 
composition series may be put into one-to-one correspondence so that 
corresponding quotient groups are isomorphic. 

Before attempting a proof of Theorem XXVIII (see Problem 23) it will be necessary 
to examine certain relations which exist between the subgroups of a group Q and the 
subgroups of its quotient groups. Let then H, of order r, be an invariant subgroup of a 
group Q of order n and write 


s = q/H = {Ha u Ha 2 , Ha 3 , .. 

.,Ha,}, di G q 

(1) 

where, for convenience, a\ = u. Further, let 



P = {Hb u Hb 2 , Hb 3 , ■ ■ 

■, Hb v } 

(2) 

be any subset of S and denote by 



K = Hbi U Hbi U Hb 3 U 

■ • • U Hb v 

(3) 


the subset of q whose elements are the pr distinct elements (of Q) which belong to the 
cosets in P. 

Suppose now that P is a subgroup of index t of S. Then n = prt and some one of the 
b’ s, say b u is the identity element u of q. It follows that K is a subgroup of index t of q 
and P = K/H since 

(i) P is closed with respect to coset multiplication; hence, K is closed with respect to 
the group operation on q. 

(ii) The associative law holds for q and thus for K. 

(iii) H G P; hence, u G K. 

(iv) P contains the inverse Hb^ 1 of each coset Hbi G P; hence, K contains the inverse of 
each of its elements. 

(v) K is of order pr\ hence K is of index t in q. 

Conversely, suppose K is a subgroup of index t of q which contains H, an invariant 
subgroup of q. Then, by Theorem XXV, H is an invariant subgroup of K and so P - K/H 
is of index t in S = q/H. 

We have proved 

Theorem XXIX. Let H be an invariant subgroup of a finite group q. A set P of the 
cosets of S = q/H is a subgroup of index t of S if and only if K, the 
set of group elements which belong to the cosets in P, is a subgroup 
of index t of q. 

We now assume b t = u in (2) and (3) above and state 
Theorem XXX. Let q be a group of order n — rpt, K be a subgroup of order rp of q, 
and H be an invariant subgroup of order r of both K and q. Then 
K is an invariant subgroup of q if and only if P - K/H is an invariant 
subgroup of S = q/H. For a proofj see Problem 2 o. 
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Theorem XXXI. Let H and K be invariant subgroups of Q with H an invariant sub¬ 
group of K, and let P = K/H and S=g/H. Then the quotient groups 
S/P and Q/K are isomorphic. „ „ _ 

For a proof, see Problem 21. 

Theorem XXXII. If H is a maximal invariant subgroup of a group q then Q/H is simple, 
and conversely. 


Theorem XXXIII. Let H and K be distinct maximal invariant subgroups of a group Q. 
Then 

(a) D = H n K is an invariant subgroup of q, and 

(b) H/D is isomorphic to g/K and K/D is isomorphic to g/H. 

For a proof, see Problem 22. 


Solved Problems 


!• Does the set of residue classes modulo three, form a group with respect to addi¬ 
tion? with respect to multiplication? 

0,1 r° m the additi0n and mu,ti P ,icati °n tables for 7/(3) in which [0], [1], [2] have been replaced by 


+ | 0 1 2 

0 0 12 

112 0 
2 2 0 1 


Table 9-5 


• | 0 1 

0 0 0 

1 0 1 

2 0 2 

Table 9-G 


2 

0 

2 


’Va tKat //(3) f . 0rmS “ gr ° Up With respect to Edition. The identity element is 0 and the inverses 
o ,1,2 are respectively 0,2,1. It is equally clear that while these residue classes do not form a 
group with respect to multiplication, the n 
and each of the elements 1,2 is its o 


2. Do the non-zero residue classes modulo four form a group with respect to multiplication ? 

From Table 5-2 of Example 12, Chapter 5, Page 53, it is clear that these residue classes do not 
form a group with respect to multiplication. 


3. 


Prove: If a,b,c Gg, then a°& = a»c (also, boa = c°a) implies 6 = c 
Consider a°6 = a°c. Operating on the left with a 1 € g, we have a~i°(a°6)- 
Using the associative law, (a-»°a)°5 = (a-»°a)°c; hence, u°b = u°c and so b = 

( b°a)°a 1 — (c°a)°a -1 reduces to b = c. 


° (a°c). 
Similarly, 


4. Prove: When a,b G q, each of the equations ao X = b and y°a = b has a unique 
solution. 

We obtain readily x = a~‘°6 and y = b°a~' as solutions. To prove uniqueness, assume 
x and y to be a second set of solutions. Then oox = oo x ' and yo a = y'o a whence, by Theorem I, 
x ~ x and y — y . 
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5. Prove: For any aeq, a m °a n = a m+n when m,nEl. 

We consider in turn all cases resulting when each of m and n are positive, zero, or negative. 
When m and n are positive, 

m factors n factors m + n factors 

When m = —r and n = s, where r and s are positive integers, 

a moa« = a-roa* = = (o~i ° o" 1 ° • - • ° a~ l ) ° (a ° a o • • • o a) 

fo s_r = a m+n when s — r 

~ ~ = a s ~ r . = a m+n when * < r 

The remaining cases are left for the reader. 


6. Prove: A non-empty subset q' of a group q is a subgroup of Q if and only if, for all 
a, be Q', a-'o be q'. 

Suppose Q’ is a subgroup of Q. If a,b € Q’, then a~ l £ Q’ and, by the Closure Law, so also 

Conversely, suppose Q' is a non-empty subset of Q for which a~ 1 °beg' whenever a,b £ Q'. 
Now a~ l °a = u& Q'. Then u°a~ l = a~ l e Q’ and every element of Q' has an inverse in Q'. 
Finally, for every a,b £ Q', (a~ l )~ l °b = a°b £ Q', and the Closure Law holds. Thus, Q' is a group 
and, hence, a subgroup of Q. 


7. Prove: If S is any set of subgroups of a group q, the intersection of these subgroups 
is also a subgroup of q. 

Let a and b be elements of the intersection and, hence, elements of each of the subgroups which 
make up S. By Theorem VIII, a~ l °b belongs to each subgroup and, hence, to the intersection. 
Thus, the intersection is a subgroup of Q. 


8 . 


Prove: Every subgroup of a cyclic group is itself a cyclic group. 


Let Q' be a subgroup of a cyclic group g whose generator is a. 
positive integer such that a m £ g'. Now every element of g', being an 
k £ I. Writing ^ 


Suppose that m is the least 
element of g, is of the form a k , 


we have ® fc — a m<1+r — (a m ) v °a r 

and, hence, ® r = {a m )~“oa k 

Since both a™ and a k £ g', it follows that a r £ g'. But since r < m, r = 0. Thus k = mq, every 
element of Q' is of the form (a m )«, and g is the cyclic group generated by a m . 


9. 


The subset {u = (1), P , P 2 , P 3 , o 2 , t 2 , b = (13), e = (24)} of S* is a group (see the operation 
table below), called the octic group of a square or the dihedral group. We shall now 
show how this permutation group may be obtained using properties of symmetry of a 


square. 

Consider the square (Fig. 9-1) with vertices denoted by 1,2,3,4; 
locate its center O, the bisectors AOB and COD of its parallel sides, 
and the diagonals 103 and 204. We shall be concerned with all rigid 
motions (rotations in the plane about O and in space about the bisectors 
and diagonals) such that the square will look the same after the motion 
as before. 

Denote by p the counterclockwise rotation of the square about O 
through 90°. Its effect is to carry 1 into 2, 2 into 3, 3 into 4, and 
4 into 1; thus, p = (1234). Now p 2 = p°p = (13)(24) is a rotation 



Fig. 9-1 
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b no 0 H 180 ’ P - (1432) IS a rotation of 270 °. an d P 4 = (D = « is a rotation about O of 360° 
° 2 r -/i.Joo? thr ° Ugh 180 ° about the bisectors AOB and COD give rise respectively to 

* - (14)(23) and r 2 - (12)(34) while the rotations through 180° about the diagonals 103 and 204 give 
rise to e = (24) and b = (13). 

The operation table for this group is 

P p 2 p 3 a 2 t 2 b e 


Table 9-7 


•) 


In forming the table 

(1) fill in the first row and first column and complete the upper left 4X4 block, 

(2) complete the second row, (p»» 2 = (1234) ° (14)(23) = (13) = b, 
and then the third and fourth rows, 

(p2 0 „2 = p o( p o ff 2) = p o b = r 2 , ...) 

(3) complete the second column and then the third and fourth columns, 

(o 2o P 2 = ( ff 2o p )o p = e op = t 2 , ...) 

(4) complete the table, (ff2o x 2- ff ! 0 ( J 2 0p 2) = p 2 ) ) 

10 ' t pe ™ utation grou P on n symbols is called regular if each of its elements except the 
identity moves all n symbols. Find the regular permutation groups on four symbols. 
Using Example 4, the required groups are: 

(p, P 2 , P 3 , p 4 = (1)}, {a, o 2 , o 3 , = (1)}, and {r, t 2 , ,-3, t 4 = (i)} 

11. Prove: The mapping I->I/(n) : m-> [m] is a homomorphism of the additive group I 
onto the additive group //(%) of integers modulo n. 

However- 6 ~ ^ whe " ever m = + r > 0 ~ r < n, it is evident that the mapping is not one-to-one. 

moduTo n. I 6 /ZZ ZTf. r age set . { t°]> [i].;?]. • I* ■ - 1]> of residue classes 


, and every element of this latter set 
.••] + [•] = [*] « 
served and the mapping is a homomorphism of I onto //(«). 


r_i I r 0 i _ r Ii -- - image. Also, if a-* [r] and b -* [s], then 

[r\ + [s] - [t] the residue class modulo n of c = a+6. Thus the group operations are pre- 


12. Prove: In a homomorphism between two groups Q and Q', their identity elements 
correspond, and if x G g and x’ G q' correspond so also do their inverses. 

Denote the identity elements of g and Q’ by u and u’ respectively. Suppose now that , 
or x ¥=u, x -> x\ Then x - u o * ^ v’ a x' = x' = u’ a x’ whence, by the Cancellation 1 
e have the first part of the theorem. 

and x~! -* y’. Then u-x 


whence, by the Cancellation Law, v’ = u’ 


For the second part, suppose : 
o that y’ = (x')-K 


13. Prove: Every cyclic group of infinite order is isomorphic to the additive group I. 

Consider the infinite cyclic group g generated by a and the mapping 


of / into g. Now this mapping is clearly onto; moreover, it is one-to-one since, if for 
s a s and t <h> a‘ with a s = at, then a*-* = u and g would be finite. Finally,’ s + t 
and the mapping is an isomorphism. 


> t we had 
as+t = a s • a ( 
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14. Prove: Every finite group of order n is isomorphic to a permutation group on n 
symbols. 

Let g - (ffi, 02> 9z> ■ ■ ■. On) with group operation ■=> and define 

p . = ( 0i \ = ( 9i 92 93 9n V U =1.2,3. n) 

1 \9i°9jJ \9i°0j 02°0j 02°0} ••• 0n°9iJ 

The elements in the second row of p, are those in the column of the operation table of G labeled g t 
and, hence, are a permutation of the elements of the row labels. Thus P = {pi, P2> Pz> • ■ Pn) ’ s a 
subset of the elements of the symmetric group S n on n symbols. It is left for the reader to show 
that P satisfies the conditions of Theorem VII for a group. Now consider the one-to-one cor¬ 
respondence 


(a) 

If 9t = 0r°0»> then 0t**Pr o P* SO that 


1,2,3, ... 


■U> U.)<=vHX''J=U.) 


and (o) is an isomorphism of Q onto P. Note that P is regular. 


15. Prove: Under any homomorphism of a group Q with group operation o and identity 
element u into a group Q' with group operation <=> and identity element W, the subset 
S of all elements of Q which are mapped onto u' is an invariant subgroup of Q. 

As consequences of Theorem XIII, we have 

(a) u-*u'; hence, S is non-empty. 

(b) if a G S, then a-» -» (u') _1 = u'; hence, a -1 S S. 

(c) if a.bGS, then a" 1 °6 -► v! ■=>«' = hence, a~ l °beS. 

Thus S is a subgroup of Q. 

For arbitrary aG S and g G g, 

g-loaog ^ (g')-l a u'°g' = u' 

so that g~'°a°g G S. Then by Theorem XX, S is an invariant subgroup of g as required. 


16. Prove: The product of cosets 

(Ha)(Hb) = {{hioa)o(h 2 ob) : hi,h 2 GH} for all Ha, Hb G g/H 
where H is an invariant subgroup of Q, is well defined. 

First, we show: For any as,*' e g, Hx' = Hx if and only if x' = v°x for some veff. 
Suppose Hx' = Hx. Then x'GHx requires x' = v°x for some vGH. Conversely, if x -v°x 
with vGH, then Hx' = H(v °x) = (Hv)x = Hx. 

Now let Ha' and Hb' be other representations of Ha and Hb respectively with a' = o°r, 
b' — b°s, and r,s G H. In ( Ha')(Hb') = {[Ai °(a° r)] ° [fc 2 o (6 o g)] : h u h 2 G H} we have, using 
(ij), Page 87, 

[A t o(a°r)] ° [A 2 °(b° s)] = (h^a) ° (r°h 2 ) o (bos) 

= (h 1 °a)oh 3 o(tob) = (h 1 oa)o{h 3 ot)ob 
= o a) ° (A 4 o 6) where h 3 , t, h 4 G H 

Then (Ha')(Hb') = (Ha)(Hb) 

and the product ( Ha)(Hb) is well-defined. 


17. Prove: Any quotient group of a cyclic group Q is cyclic. 

Let H be any (invariant) subgroup of the cyclic group Q = {«. a, a 2 , ..., a r } and consider the 
homomorphism g gjjj. jj a i 

Since every element of g/H has the form Ha- for some € g and H* = ( Ha )* (prove this) it fol¬ 
lows that every element of g/H is a power of b = Ha. Hence, g/H is cyclic. 




CHAP. 9] 


GROUPS 


95 


18. Prove: Every finite group g has at least one composition series. 

(i) Suppose g is simple; then g, U is a composition series. 

(ii) Suppose g is not simple; then there exists an invariant subgroup H ¥> g, U of g. If H is maxi¬ 
mal in g and U is maximal in H, then g,H,U is a composition series. Suppose H is not 
maximal in g but U is maximal in H\ then there exists an invariant subgroup K of g such that 
H is an invariant subgroup of K. If K is maximal in Q and H is maximal in K, there Q, K, H, U 
is a composition series. Now suppose H is maximal in g but U is not maximal in H; then 
there exists an invariant subgroup J of H. If J is maximal in H and U is maximal in J, then 
g,H, J, U is a composition series. Next, suppose that H is not maximal in g and U is not 
maximal in H; then .... Since g is finite, there are only a finite number of subgroups and 
ultimately we must reach a composition series. 


19. Consider two composition series of the cyclic group of order 60: Q = { u,a,a 2 , .. ,,a 59 }: 

Q,H = {u, a 2 , a 4 ,..., a 58 }, J = {u, a 4 , a 3 , ..., a 58 }, K = {u, a 12 , a 24 , a 36 , a 48 }, U = {u} 
and 

Q,M - {u,a 3 ,a 3 , .. .,a 57 }, N = {«,a l5 ,a 30 ,a 45 }, P= {u,a 30 }, U 

The quotient groups are: 

g/H = {H, Ha), HU - {J, Jo*}, J/K = {K, Ka 4 , Ka 8 }, K/U = {17,17a 12 , Ua 2 *, Ua™, Ua 48 > 

and 

g/M = {M, Ma, Ma 2 }, M/N = {N, Na 3 , No®, No 9 , No 12 }, NIP = {P, Pa 13 }, P/U = {U, Ua 30 } 

Then in the one-to-one correspondence: g/H ++ NIP, H/J P/U, J/K g/M, K/U *-* M/N, cor¬ 
responding quotient groups are isomorphic under the mappings: 

H P J*+U K M U<*N 

Ha Pa 15 Ja 2 Ua 30 Ka* Ma Ua 12 Na 3 

Ka 8 <-»• Mo 2 Ua 24 No 6 

Ua 33 ^ No 9 
Ua 43 Na 12 

20. Prove: Let Q be a group of order n = rpt, If be a subgroup of order rp of Q, and H be 
an invariant subgroup of order r of both K and Q. Then K is an invariant subgroup 
of g if and only if P = K/H is an invariant subgroup of S = g/H. 

Let g be an arbitrary element of g and let K = {6„ b 2 , . . .,6^}. 

Suppose P is an invariant subgroup of S. For Hg e S, we have 
0) (Hg)P = P(Hg) 

Thus for any N6j e P, there exists Hb } G P such that 
(ii) (HgXHbJ = (Hb^Hg) 

Moreover, ( Hg)(Hb ,) = (Hbj)(Hg) = ( Hg)(Hb k ) implies Hb t = Hb k . Then 
(»■) Hbi = (Hg~ l )(Hbj)(Hg) = gHHb^g 

(>v) K = Hbi U Hb 2 U ••• U Hb„ = g~ 4 Kg 

and 

(v) gK = Kg 

Thus K is an invariant subgroup of g. 

Conversely, suppose K is an invariant subgroup of g. Then, by simply reversing the steps above, 
we conclude that P is an invariant subgroup of S. 



GROUPS 


[CHAP. 9 


21. Prove: Let H and K be invariant subgroups of Q with H an invariant subgroup of K, 
and let P = K/H and S = q/H. Then the quotient groups S/P and q/K are isomorphic. 


Let Q, K, H have the respective orders n — rpt, rp, r. 
index t in Q and we define 

g/K = (Ac,, Kc 2 , ..., Kc,} , 


Then K is 

<?i e g 


invariant subgroup of 


By Theorem XXX, P is an invariant subgroup of S; then P partitions S into t cosets so that we 
may write S/P = {P(Ha .^ P(Ha ^ . P(Ha it )}, Ha { . G S 

Now the elements of g which make up the subgroup K, when partitioned into cosets with respect 
to H, constitute P. Thus each c k is found in one and only one of the Ha^. Then, after rearranging 
the cosets of S/P when necessary, we may write 

S/P = {P(Hc j), P(Hc 2 ), P(Hc t )} 


The required mapping is g/K S/P : Kc t ** P(He t ) 


22. Prove: Let H and K be distinct maximal invariant subgroups of a group q. Then 
(a) D — H fi K is an invariant subgroup of q and ( b ) H/D is isomorphic to q/K and 
K/D is isomorphic to q/H. 

(a) By Theorem X, D is a subgroup of g. Since H and K are invariant subgroups of g, we have for 
each d€ D and every 

p -1 °d° g 6 H, g~ lo d°g € K and so g~ l °d°g&D 
Then, for every g G g, g~ l Dg — D and D is an invariant subgroup of g. 

(b) By Theorem XXV, D is an invariant subgroup of both H and K. Suppose 

(i) H = Dh x u Dh 2 u ... u Dh n , h t € H 
then, since K(Dh t ) = (KD)ht = Kh { (whyt), 

(ii) KH = Kh x u Kh 2 u ... u Kh n 

By Theorem XXVI, HK - KH is a subgroup of g. Then, since H is a proper subgroup of HK 
and, by hypothesis, is a maximal invariant subgroup of g, it follows that HK = g. 

From (i) and (ii), we have 

H/D = {Dh u Dh 2 , .... Dh n ) and g/K = {Kh u Kh 2 , ..., Kh n } 

Under the one-to-one mapping 

Dhi ** Khi, @= 1 , 2,8 .») 

(Dhi)(Dh } ) = D{hi o h s ) ** K(h,ohj) = (Kh { )(Khj) 
and H/D is isomorphic to g/K. It will be left for the reader to show that K/D and g/H are 
isomorphic. 


23. Prove: For a finite group with distinct composition series, all series are of the same 
length, i.e., have the same number of elements. Moreover the quotient groups for any 
pair of composition series may be put into one-to-one correspondence so that correspond¬ 
ing quotient groups are isomorphic. 

Let (a) g, H u H 2 , H 3 , .... H r = U 

(b) g, K u K 2 , K s , ..., K s = U 

be two distinct composition series of g. Now the theorem is true for any group of order one. Let us 

assume it true for all groups of order less than that of g. We consider two cases: 

(i) Hi = K x . After removing g from (a) and (6), we have remaining two composition series of H x 

for which, by assumption, the theorem holds. Clearly, it will also hold when g is replaced in each. 






CHAP. 9] 


GROUPS 


97 


< u ) K i- Write D - Hi n K t . Since Q/H x (also g/K x ) is simple and, by Theorem XXXIII, 

is isomorphic to KJD (also g/K x is isomorphic to HJD), then KJD (also HJD) is simple. Then 
D is the maximal invariant subgroup of both H x and K x and so Q has the composition series 

^ («') Q, H lt D, D lt D 2 , D 3 . D t = U 

(6') g, K u D, D u D 2 , D 3 . D t = U 

When the quotient groups are written in the order 

Q/Hi, Hi/D, D/D u Di/D 2 , D 2 /D 3 , ..., D t _JD t 
KJD, G/Ki, D/D lt DJD 2 , D 2 /D 3 , .... D t _JD t , 

corresponding quotient groups are isomorphic, that is, g/H t and KJD, HJD and Q/K x , D/D x and 
D/D x , . . . are isomorphic. 

Now by (i) the quotient groups defined by (a) and (o') [also by (6) and (6')] may be put into 
one-to-one correspondence so that corresponding quotient groups are isomorphic. Thus the quotient 
groups defined by (a) and (b) are isomorphic in some order, as required. 


Supplementary Problems 

24. Which of the following sets form a group with respect to the indicated operation: 

(a) S = {*: *£/,*< 0} ; addition 

( b) S = {5* : x £ /} ; addition 

(c) S = {x: x 6 /, x is odd}; multiplication 

(d) The n nth roots of 1; multiplication 

(e) S = {—2, —1,1,2}; multiplication 
(/) S = {1, —1, i, — i} ; multiplication 

( 0) The set of residue classes modulo m; addition 
(h) S = {[o] : [a] £ I/(m), ( a,m ) = 1} ; multiplication 
(*) S = {z : z £ C, |*| = 1} ; multiplication 
Ans. (a), (c), (e), do not. 


25. Show that the non-zero residue classes modulo p form a group with respect to multiplication if and 
only if p is a prime. 

26 ‘ SUbsets of //(13) is a S rou P with res P ect to multiplication: (a) {[11, [121}; 

W {[!], [2], [4], [6], [8], [10], [12]}; (c) {[1], [5], [8], [12]}? Ans. (a), (e). 

27. Consider the rectangular coordinate system in space. Denote by a,b,c 
respectively clockwise rotations through 180° about the X,Y,Z- axis 
and by u its original position. Complete the adjacent table to show 
that {u, a, b, c } is a group, the Klein 4-group. 

28. Prove Theorem III, Page 83. 

Hint, a 1 °x = u has x = a and x = (o -1 ) -1 as solutions. 

29. Prove Theorem IV, Page 83. 

Hint. Consider (a°6)°(&-i o 0 -i) = oo(6°6“ 1 )<>a“ 1 


Prove: Theorem V, Page 83. 
Prove: a~ m = (o m ) _1 , m £ /. 


32. Complete the proof of Theorem VI, Page 83. 
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33. Prove: Theorems IX, XI on Page 84 and Theorem XIV on Page 85. 

34. Prove: Every subgroup (j‘ of a group Q has u, the identity element of <7, as identity element. 

35. List all of the proper subgroups of the additive group 7/(18). 

36. Let Q be a group with respect to ° and a be an arbitrary element of Q. Show that 

77 = {x : x € <7, x ° a = a ° x} 

is a subgroup of Q. 

37. Prove: Every proper subgroup of an abelian group is abelian. State the converse and show by an 
example that it is false. 

38. Prove: The order of is the order of the cyclic subgroup generated by a. 

39. Find the order of each of the elements (o) (123), (6) (1432), (c) (12)(34) of S 4 . 

Ans. (a) 3, (6 ) 4, (c) 2 

40. Verify that the subset A„ of all even permutations in S„ forms a subgroup of S n . Show that each 
element of A n leaves the polynomial of Problem 12, Chapter 2, Page 27, unchanged. 

41. Show that the set (x : x e 7, 5 | x) is a subgroup of the additive group 7. 

42. Form an operation table to discover whether {(1), (12)(34), (13)(24), (14)(23)} is a regular permutation 
group on four symbols. 

43. Determine the subset of S 4 which leaves (o) the element 2 invariant, (6) the elements 2 and 4 invari¬ 
ant, (c) x,x 2 + x 3 x 4 invariant, (d) XiX 2 + x s + x 4 invariant. 

Ans. (a) {(1), (13), (14), (34), (134), (143)} (c) {(1), (12), (34), (12)(34), (13)(24), (14)(23), (1423), (1324)} 

(6) {(1), (13)} ( d) {(1), (12), (34), (12)(34)} 

44. Prove the second part of Theorem XV, Page 86. Hint. Use [m] <-» a m . 

45. Show that the Klein 4-group is isomorphic to the subgroup P = {(1), (12)(34), (13)(24), (14)(23)} of S 4 . 

46. Show that the group of Example 7 is isomorphic to the permutation group 

P = {(1), (12)(35)(46), (14)(25)(36), (13)(26)(45), (156)(243), (165)(234)} 


on six symbols. 


47. Show that the non-zero elements 7/(13) under multiplication form a cyclic group isomorphic to 
the additive group 7/(12). Find all isomorphisms between the two groups. 

48. Prove: The only groups of order 4 are the cyclic group of order 4 and the Klein 4-group. 

Hint. Q = (u, a, b, e} either has an element of order 4 or all of its elements except u have order 2. 
In the latter case, a°6 # a,b,u by the Cancellation Laws. 

49. Let S be a subgroup of a group G and define T = {x: x € <7, Sx = xS}. Prove that T is a sub¬ 
group of G• 

50. Prove: Two right cosets Ha and Hb of a subgroup 77 of a group G are identical if and only if 
ab-i&H. 

51. Prove: a e 776 implies Ha = 776 where 77 is a subgroup of G and 0,6 e g. 

52. List all cosets of the subgroup ((1), (12)(34)} in the octic group. 

53. Form the operation table for the symmetric group S 3 on three symbols. List its proper subgroups 
and obtain right and left cosets for each. Is S 3 simple? 

54. Obtain the symmetric group of Problem 53 using the properties of symmetry of an equilateral triangle. 

55. Obtain the subgroup {u, p 1 , a 2 , r 2 } of S 4 using the symmetry properties of a non-square rectangle. 

56. Obtain the alternating group A 4 of S 4 using the symmetry properties of a regular tetrahedron. 


57. Prove: Theorem XXV, Page 89. 
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58. Show that K - {it, p 2 , a 2 , r 2 } is an invariant subgroup of S 4 . 
homomorphism S 4 -» SJK: x -» Kx. 

Partial answer. U -* K, (12) -► K(\2), (13) -» K (13), .... (24) - 


Obtain SJK and write out in 
K( 13), (34) - K( 12), .... 


full the 


59. Use K - {u, p 2 , o^, t 2 }, an invariant subgroup of S 4 and H = {u.o 2 }, an invariant subgroup of K, 
to show that a proper invariant subgroup of a proper invariant subgroup of a group Q is not neces¬ 


sarily an invariant subgroup of g. 


60. Prove: The additive group 7/(m) is a quotient group of the additive group I. 

“■ £ r ° Ve ; , I£ H is an invariant subgroup of a group Q, the quotient group Q/H is cyclic if the index of 
ti m Q is a prime. 


f (i)> p*» ®*» t 2 -« 

62. Show that the mapping «, fi 2 , y, S 2 - a defines a homomorphism of A 4 onto Q = {u,a,a 2 }. Note 
l“ 2 . P, V 2 , * -*• a 2 

that the subset of A 4 which maps onto the identity element of Q is an invariant subgroup of A 4 . 


63. 


Prove: In a homomorphism of a group Q onto a group Q\ let H be the set of all 
which map into u' € Q’. Then the quotient group of Q/H is isomorphic to Q'. 


elements of 


Q 


64. Set up a homomorphism of the octic group onto {it, a}. 


65. When H - a 2 } and K — {u, fi, p 2 } are subgroups of S 4 , show that HK^KH. Use HK and KH 

to verify: In general, the product of two subgroups of a group g is not a subgroup of Q. 


66 . 


Prove: If H = {h lt h 2 . h r ) 

invariant, then (a) HK = KH, 


and K - (6,, b 2 , 6 p } are subgroups of 

(6) HK is a subgroup of Q. 


group g and 


is 


67. Prove: If H and K are invariant subgroups of Q, so also is HK. 


68. Let g, with group operation ° and identity element «, and Q', with group operation o and unity ele¬ 
ment u , be given groups and form 


J = gxg' = {(g,g')\ geg.g'eg'} 

Define the “product” of pairs of elements (g,g'),(h,h') GJ by 
(*) (g.g’Kh, h') = (g°h,g' a h') 

(а) Show that J is a group under the operation defined in (i). 

(б) Show that S = {(g, it'): g G g) and T = {(u.g '): g' € Q'} are subgroups of J. 

(c) Show that the mappings 


are isomorphisms. 


S~+ Q : (g, it') -» g 


and T -» g' : {u,g’)->g' 


69 ‘ p°^ v f: and (f °f Problem 68, define U = {u} and V = {«'}; also Q = g X U' and Q' = VXQ'. 
(a) g and Q' are invariant subgroups of J. 

{b) J/g is isomorphic to U X Q', and J/Q' is isomorphic to Q X V. 

(c) g and Q' have only (it, it') in common. 

(d) Every element of Q commutes with every element of Q'. 

(e) Eve|y element of J can be expressed uniquely as the product of an element of Q by an element 


7°. Show that S 4 P 2 , v 2 , r 2 }, { u ,a 2 }, U is a composition series of S 4 . Find another in addition to 

that of Example 13(6), Page 90. 
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71. For the cyclic group g of order 36 generated by a: 

(i) Show that a 2 , a 3 , a*, a 6 , a 9 , a 12 , a 18 generate invariant subgroups g 18 , G$, Ge, Gi< Ga> G% respec¬ 

tively of Q. 

(ii) Q,Qia,Q 9 ,G 3 ,U is a composition series of Q. There are six composition series of Q in all; 
list them. 

72. Prove: Theorem XXXII, Page 91. 

73. Write the operation table to show that Q = {l,—\,i,—i,j,—j,k,—k} satisfying i — j — k — 1, 

ij = k = -ji, jk ~ i- -kj, ki — j — —ik forms a group. 

74. Prove: A non-commutative group Q, with group operation ° has at least six elements. 

Hint. (1) Q has at least 3 elements: u, the identity, and two non-commuting elements a and b. 

(2) g has at least 5 elements: u,a,b,a°b,b°a. Suppose it had only 4. Then a°b ¥= b°a 
implies a°b or b°a must equal some one of u,a,b. 

(3) g has at least 6 elements: u,a,b,a°b,b° a, and either o 2 or a°b°a. 

75. Construct the operation tables for each of the non-commutative groups with 6 elements. 

76. Consider S = {u, a, a 2 , a 3 , b, ab, a 2 b, a 3 b) with o 4 = u. Verify: 

(a) If b 2 = u, then either ba = ab or ba = a 3 b. Write the operation tables A g , when ba = ab, 
and D s , when ba = a 3 b, of the resulting groups. 

(b) If b 2 = a or 6 2 = a 3 , the resulting groups are isomorphic to C g) the cyclic group of order 8. 

(c) If b 2 = a 2 , then either ba = ab or ba = a 3 b. Write the operation tables Ag, when ba = ab, and 
Q g , when ba = a 3 b. 

(d) Ag and Ag are isomorphic. 

(e) Z> 8 is isomorphic to the octic group. 

(/) Q g is isomorphic to the (quaternion) group Q of Problem 73. 

(g) Qg has only one composition series. 

77. Obtain another pair of composition series of the group of Problem 19; set up a one-to-one correspond¬ 
ence between the quotient groups and write the mappings under which corresponding quotient groups 
are isomorphic. 
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Rings 

RINGS 

A non-empty set % is said to form a ring with respect to the binary operations addi¬ 
tion (+) and multiplication (•) provided, for arbitrary a,b,c £ %, the following properties 
hold: _ 

(a + b) + c - a + (6 + c) (associative law of addition) 

a, + b = b + a (commutative law of addition) 

There exists z G.% such that a + z = a. 

(existence of an additive identity (zero)) 
For each a € % there exists —a E % such that a + (-a) = z. 

(existence of additive inverses) 
(a • 6) • c = a • (6 • c) (associative law of multiplication) 

a(6 + c) = a • & + a • c 

(b + c)a = b-a + c-a (distributive laws) 


Example 1: Since the properties enumerated above are only a partial list of the properties com¬ 
mon to I,Q,R, and C under ordinary addition and multiplication, it follows that 
these systems are examples of rings. 

Example 2: The set S = {x + yyfl + zy/S> : x,y,z G Q} is a ring with respect to addition and 
multiplication on R. To prove this, we first show that S is closed with respect to 
these operations. We have, for a + by/3 + cy/9, d+ ev^3 + /\^9 G S, 

(a + by/z + cy/9 ) + (d + ey/i + fy/$) = (a + d) + (b + e)^3 + (c + f)y/9 G S 
and 

(a + by/d + cfo)(d + e^3 + fy/ 9) = (ad + Zbf + Zoe) + (ae + bd + 3cf)y/3 
+ (af + be + ed)y/§ G S 

Next, we note that P„P 2 ,P 5 -P 7 hold since S is a subset of the ring R. Finally, 
0 = 0 + 0y/3 + 0V9 satisfies P 3 , and for each * + yy/H + zyf§ G S there exists 
—x — yy /3 — zy/9 G S which satisfies P 4 . Thus S has all of the required properties 

Example 3: (a) The set S = {a, 6} with addition and multiplication defined by the tables 

+ a b . I a b 

b b a blob 

is a ring. 


(b) The set T — {o, b, c, d) with addition and multiplication defined by 
+ 1 • b c d • | a b c 

a a b c d a a a 7 


b b o d 


and 


c c d 


d | d c b 
is a ring. 


d I a d a 


d 


b 

d 
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For these rings the zero element is a and each element is its own additive inverse. 

See also Problems 1-3. 


In Examples 1 and 2 the binary operations on the rings (the ring operations) coincide 
with ordinary addition and multiplication on the various number systems involved; m 
Example 3 the ring operations have no meaning beyond the given tables. In this example 
there can be no confusion in using familiar symbols to denote the ring operations. How¬ 
ever, when there is a possibility of confusion, we shall use ® and © to indicate the ring 
operations. 

Example 4: Consider the set of rational numbers Q. Clearly addition (©) and multiplication (O) 

defined by a ^ b = a . b and aQb = a+b for all a,b G Q 


where + and • are ordinary addition and multiplication on rational numbers, are 
binary operations on Q. Now the fact that P 1( P 2 , and P 5 hold is immediate; also, 
P 3 holds with z = 1. We leave it for the reader to show that P 4 , P 8 . and p 7 do not 
hold and so Q is not a ring with respect to © and O. 


PROPERTIES OF RINGS 

The elementary properties of rings are analogous to those properties of I which do 
not depend upon either the commutative law of multiplication or the existence of a 
multiplicative identity element. We call attention here to some of these properties: 

(i) Every ring is an abelian additive group. 

(ii) There exists a unique additive identity element 2 , (the zero of the ring). 

See Theorem III, Chapter 2, Page 20. 

(iii) Each element has a unique additive inverse, (the negative of that element). 

See Theorem IV, Chapter 2, Page 20. 

(iv) The Cancellation Law for addition holds. 

( V ) -(-a) = a, -(a + b) = (-a) + (-5) for all a,b of the ring. 

(vi) a *z = 2 *a = 2 ( For a P roof > see Problem 4.) 

(vii) a(-b) = -{ab) = (- a)b 

SUBRINGS 

Let % be a ring. A non-empty subset S of the set %, which is itself a ring with 
respect to the binary operations on % is called a subring of When S is a subring of 
a ring %, it is evident that S is a subgroup of the additive group %. 

Example 5: (a) From Example 1 it follows that / is a subring of the rings Q,R,C; that Q is a 

subring of R,C\ and R is a subring of C, 

(b) In Example 2, S is a subring of R. 

(e) In Example 3(6), T 1 = {o}, T 2 = {o, 6} are subrings of T. Why is T a = {a, b, c} 
not a subring of Tt 

The subrings { 2 } and % itself of a ring % are called improper; other subrings, if any, 
of % are called proper. 

We leave for the reader the proof of 

Theorem I. Let % be a ring and S be a proper subset of the set %. Then S is a subring 
of % if and only if 

(a) S is closed with respect to the ring operations. 

(b) for each a e S, we have -a G S. 
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TYPES OF RINGS 

A ring for which multiplication is commutative is called a commutative ring. 

Example 6: The rings of Examples 1,2,3(a) are commutative; the ring of Example 3(6) is non- 
commutative, i.e., 6• c = a but e-b — c. 

A ring having a multiplicative identity element ( unit element or unity) is called a 
ring with identity element or ring with unity. 

Example 7: For each of the rings of Examples 1 and 2, the unity is 1. The unity of the ring 
of Example 3(a) is 6; the ring of Example 3(6) has no unity. 

Let % be a ring with unity u. Then u is its own multiplicative inverse {u~ l — u), but 
other non-zero elements of % may or may not have multiplicative inverses. Multiplicative 
inverses, when they exist, are always unique. 

Example 8: (a) The ring of Problem 1 is a non-commutative ring without unity. 

(6) The ring of Problem 2 is a commutative ring with unity u — h. Here the non¬ 
zero elements b,e,f have no multiplicative inverses; the inverses of e,d,g,h are 
g,d,c,h respectively. 

(c) The ring of Problem 3 has as unity u = (1,0,0,1). (Show this.) Since 
(1,0,1,0)(0,0,0,1) = (0,0,0,0) while (0,0,0,1)(1,0,1,0) = (0,0,1,0), the ring is 
non-commutative. The existence of multiplicative inverses is discussed in 
Problem 5. 


CHARACTERISTIC 

Let % be a ring with zero element z and suppose that there exists a positive integer n 
such that na = a + a + a+ ■ ■ ■ +a = z for every aG% The smallest such positive 
integer « is called the characteristic of %. If no such integer exists, % is said to have 
characteristic zero. 

Example 9: (a) 

(b) 


(c) 


DIVISORS OF ZERO 

Let % be a ring with zero element z. An element a ¥* z of % is called a divisor of zero 
if there exists an element b *= z of % such that a-b = z or b-a = z. 

Example 10: (a) The rings I,Q,R,C have no divisors of zero, that is, in each system a6 = 0 
always implies a = 0 or 6 = 0. 

(6) For the ring of Problem 3, we have seen in Example 8(c) that (1,0,1,0) and 
(0,0,0,1) are divisors of zero. 

(c) The ring of Problem 2 has divisors of zero since 6 • e = a. Find all divisors 
of zero for this ring. 


HOMOMORPHISMS AND ISOMORPHISMS 

A homomorphism (isomorphism) of the additive group of a ring % into (onto) the 
additive group of a ring %' which also preserves the second operation, multiplication, is 
called a homomorphism (isomorphism) of % into (onto) 

Example 11: Consider the ring % = {a, 6, c, d) with addition and multiplication tables 


The rings I,Q,R,C have characteristic zero since for these rings na = %• a. 

In Problem 1 we have a + a = b + b=--- = h + h = a, the zero of the ring, 
and the characteristic of the ring is two. 

The ring of Problem 2 has characteristic four. 
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and the ring %' = { p, q, r, s} with addition and multiplication tables 


The one-to-one mapping 


carries % onto %' (also, %' onto %) and at the same time preserves all binary opera¬ 
tions; for example, 

d = b + c*-*q + s = P 
b = c'd*-*s-p = q, etc. 

Thus, % and %' are isomorphic rings. 


Using the isomorphic rings % and %’ of Example 11, it is easy to verify 
Theorem II. In any isomorphism of a ring % onto a ring 

(а) if * is the zero of % and 2 ' is the zero of we have 2 *-» z\ 

(б) if %<*%’ : a a', then -a -o'. 

(c) if u is the unity of % and u’ is the unity of we have u u’. 

( d) if % is a commutative ring, so also is 

IDEALS 

Let % be a ring with zero element z. A subgroup S of having the property 

r-xGS (X’rGS) for all x G S and rG%, is called a left (right) ideal in %. Clearly, 
{z} and % itself are both left and right ideals in %; they are called improper left (right) 
ideals in %. All other left (right) ideals in if any, are called proper. 

A subgroup 3 of %, which is both a left and right ideal in %, that is, for all x G 3 
and rG% both r-x G 3 and x-r G 3, is called an ideal ( invariant subring) in %. 
Clearly, every left (right) ideal in a commutative ring % is an ideal in %. 

For every ring %, the ideals (z) and % itself are called improper ideals in %; any other 
ideals in % are called proper. A ring having no proper ideals is called a simple ring. 

Example 12: (o) For the ring S of Problem 1, {a,b,c,d} is a proper right ideal in S (examine the 

first four rows of the multiplication table) but not a left ideal (examine the first 
four columns of the same table). The proper ideals in S are (a, c}, {a, e), {a, g), 
and {o,c, e,g). 

(6) In the commutative ring /, the subgroup P of all integral multiples of any 
integer p is an ideal in I. 

(c) For every fixed a,6GQ, the subgroup J = {(or, br, as, bs): r,8 G Q} is a 
left ideal in the ring M of Problem 3 and K = {(or, as, br, bs) : r,s e Q} is a 
right ideal in M since, for every (m,n,p,q) £ M, 

(m,n,p,q)-iar,br,as,bs) - (a{mr + ns), b(mr + ns), a(pr + qs), b(pr+ qs)) £ J 
and 

(or, as, br, bs) • (m, n, p, q) - (o(mr + ps), a(nr + qs), b(mr + ps), b(nr + qs)) £ K 
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Example 12(6) illustrates 

Theorem III. If p is an arbitrary element of 
{p-r: rG%] is an ideal in 


commutative ring then P = 
For a proof, see Problem 9. 


In Example 12(a), each element x of the left ideal {a, c, e, g] has the property that it 
is an element of S for which r-x = a, the zero element of S, for every r G S. This 
illustrates 


Theorem IV. Let % be a ring with zero element z; then 

T ~ {x: xG%, r-x-z (x‘r~z) ior&\\ rG%) 
is a left (right) ideal in %. 


Let P, Q,S,T, ... be any collection of ideals in a ring % and define 3 = 
PnQnSnrn - . Since each ideal of the collection is an abelian additive group, so 
also, by Theorem X, Chapter 9, Page 84, is 3. Moreover, for any x G 3 and r G%, the 
products x-r and r-x belong to each ideal of the collection and, hence, to 3. We have 
proved 


Theorem V. The intersection of any collection of ideals in a ring is an ideal in the ring. 
In Problem 10, we prove 

Theorem VI. In any homomorphism of a ring % onto another ring %' the set S of 
elements of % which are mapped on z', the zero element of is an ideal 
in %. 


Example 13: Consider the ring G = {a+ bi: a,b G I) of Problem 8. 

(a) The set of residue classes modulo 2 of G is H = {[0], [1], [i], [1 + i]}. (Note that 
1 - » = 1 +1 (mod 2).) Prom the operation tables for addition and multiplica¬ 
tion modulo 2, it will be found that H is a commutative ring with unity also H 
has divisors of zero although G does not. 

The mapping G -* H : g -» [g] is a homomorphism in which S = {2g : g S G}, 
an ideal in G, is mapped on [0], the zero element of H. 

{b) The set of residue classes modulo 3 of G is 

K = {[ 0 ]. [ 1 ], [*]. [ 2 ], [ 2 »], [1 + 1 ], [2 + *], [1 + 2 * 1 , [2 + 2 *]} 

It can be shown as in (a) that If is a commutative ring with unity but is without 


PRINCIPAL IDEALS 

Let % be a ring and K be a right ideal in % with the further property 
K — {a • r: r G %, a is some fixed element of K } 

We shall then call K a principal right ideal in % and say that it is generated by the 
element a of K. Principal left ideals and principal ideals are defined analogously. 

Example 14: (a) In the ring S of Problem 1, the subring {a,g} is a principal right ideal in S 

generated by the element g (see the row of the multiplication table opposite g) 
Smce f-g — a for every rGS (see the column of the multiplication table 
headed g), {a,g} is not a principal left ideal and, hence, not a principal ideal in S. 

(b) In the commutative ring S of Problem 2, the ideal (o, 6, e, /> in S is a principal 
ideal and may be thought of as generated by either b or f. 

(c) In the ring S of Problem 1, the right ideal {a,b,e,d} in S is not a principal 
right ideal since it cannot be generated by any one of its elements. 

(d) For any me I, J — {mx: x G /} is a principal ideal in I. 
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In the ring /, consider the principal ideal K generated by the element 12. It is clear 
that K is generated also by the element -12. Since K can be generated by no other of its 
elements, let it be defined as the principal ideal generated by 12. The generator 12 of K, 
besides being an element of K, is also an element of each of the principal ideals: A generated 
by 6, B generated by 4, C generated by 3, D generated by 2, and I itself. Now K C A, 
K C B, K C C, K C D, K C 7; moreover, 12 is not contained in any other principal ideal 
of /. Thus K is the intersection of all principal ideals in / in which 12 is an element. 

It follows readily that any principal ideal in 1 generated by the integer m is contained 
in every principal ideal in 1 generated by a factor of m. In particular, if m is a prime 
the only principal ideal in I which properly contains the principal ideal generated by m is /. 

Every ring % has at least one principal ideal, namely, the null ideal {z} where z is 
the zero element of %. Every ring with unity has at least two principal ideals, namely, {z} 
and the ideal % generated by the unity. 

Let % be a commutative ring. If every ideal in is a principal ideal, we shall call % 
a principal ideal ring. For example, consider any ideal 3 ¥* {0} in the ring of integers I. 
If a ¥* 0 G 3 so also is -a. Then 3 contains positive integers and, since / + is well- 
ordered, contains a least positive integer, say e. For any b G 3, we have by the Division 
Algorithm of Chapter 5, Page 50, 

b — e’q + r, q,rGl, 0 — r<e 

Now e-qG3\ hence, r = 0 and b = e-q. Thus, 3 is a principal ideal in I and we 
have proved 

The ring / is a principal ideal ring. 


PRIME AND MAXIMAL IDEALS 

An ideal 3 in a commutative ring % is said to be a prime ideal if, for arbitrary elements 
r, s of %, the fact that r • s G 3 implies either r G 3 or s G 3. 

Example 15: In the ring I, 

(a) the ideal J = {1r: r€/}, also written as J = (7), is a prime ideal since if 
a • b € J either 7 | a or 7 | 6; hence, either a G J or b G J. 

(b) the ideal K = (14r: r e /> or K = (14) is not a prime ideal since, for example, 
28 = 4 • 7 e K but neither 4 nor 7 is in K. 

Example 15 illustrates 

Theorem VII. In the ring I a proper ideal 3 = (mr: rGl,m¥*0} is a prime ideal if 
and only if m is a prime integer. 

A proper ideal 3 in a commutative ring % is called maximal if there exists no proper 
ideal in % which properly contains 3- 

Example 16: (a) The ideal J of Example 15 is a maximal ideal in I since the only ideal in 1 which 

properly contains J is I itself. 

(6) The ideal K of Example 15 is not a maximal ideal since K is properly contained 
in J which, in turn, is properly contained in I. 


QUOTIENT RINGS 

Since the additive group of a ring % is abelian, all of its subgroups are invariant 
subgroups. Thus, any ideal 3 in the ring is an invariant subgroup of the additive group % 
and the quotient group %!3 — {r + 3- r G %} is the set of all distinct cosets of 3 in %. 
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(Note. The use of r + 3 instead of the familiar rS for a coset is in a sense unnecessary 
since, by definition, r3 = {r°a: nGj} and the operation here is addition. Nevertheless, 
we shall use it.) In the section titled Quotient Groups of Chapter 9, Page 88, addition (+) 
on the cosets (of an additive group) was well-defined by 

(* + 3) + (y + 3) = (x + y) + 3 
We now define multiplication (•) on the cosets by 

(x + 3) ■ (y + 3) = (x • y) + 3 

and establish that it too is well-defined. For this purpose suppose x' = x + s and y' — y + t 
are elements of the additive group % such that x' + 3 and y' + 3 are other representations 
of x + 3 and y + 3 respectively. From 

x'+ 3 = (x + s) + 3 = (x + 3) + (s + 3) =.x + 3 
it follows that s (and similarly t) G 3. Then 

(x' + 3) • (y' + 3) = (x'-y') + 3 = [(x'y) + (x't) + (s-y) + (s-t)] + 3 = (x‘y) + 3 

since x • t, s-y, S’t G 3, and multiplication is well-defined. (We have continued to call 
x + 3 a coset; in ring theory, it is called a residue class of 3 in the ring %.) 

Example 17: Consider the ideal 3 — (3r: r €/} of the ring I and the quotient group 1/3 = 
{3i 1 + 3> 2 + 3)- It is clear that the elements of 1/3 are simply the residue classes 
of //(3) and, thus, constitute a ring with respect to addition and multiplication 
modulo 3. 

Example 17 illustrates 

Theorem VIII. If 3 is an ideal in a ring %, the quotient group %/3 is a ring with respect 
to addition and multiplication of cosets (residue classes) as defined above. 

It is customary to designate this ring by %/3 and to call it the quotient or factor ring 
of % relative to 3. 

From the definition of addition and multiplication of residue classes, it follows that 

(а) The mapping %->%/3:a-+a + 3 is a homomorphism of % onto %/3. 

(б) 3 is the zero element of the ring %/3. 

(c) If % is a commutative ring, so also is %/3. 

(d) If % has a unity element u so also has %/3, namely, u + 3. 

(e) If % is without divisors of zero, <3 i/3 may or may not have divisors of zero. For, while 

(a + 3)-(b + 3) = a-b + 3 = 3 
indicates a • 6 G 3, it does not necessarily imply either a G 3 or b G 3. 


EUCLIDEAN RINGS 

In the next chapter we shall be concerned with various types of rings, for example, 
commutative rings, rings with unity, rings without divisors of zero, commutative rings 
with unity, ... obtained by adding to the basic properties of a ring one or more other 
properties (see Page 71) of R. There are other types of rings and we end this chapter 
with a brief study of one of them. By a Euclidean ring is meant: 

Any commutative ring % having the property that to each x G % a non-negative 
integer 9(x) can be assigned such that 
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(i) 0(x) = 0 if and only if x = z, the zero element of %. 

(ii) 6(x • y) ^ 0(x) when X'y¥°z. 

(iii) For every x G % and y ¥= z G%, 

x-yq + r q,rG%, 0 ^ 0(r) < 9(y) 

Example 18: / is a Euclidean ring. This follows easily by using «(x) = \x\ for every x € /. 

See also Problem 12. 

There follow 

Theorem IX. Every Euclidean ring % is a principal ideal ring. 

Theorem X. Every Euclidean ring has a unity. 


Solved Problems 

1. The set S = {a, b, c, d, e, f, g, h } with addition and multiplication defined by 


is a ring. The complete verification that Pi and Ps-Pi, Page 101, are satisfied is a 
considerable chore but the reader is urged to do a bit of “spot checking”. The zero 
element is a and each element is its own additive inverse. 

2. The set S of Problem 1 with addition and multiplication defined by 


is a ring. What is the zero element? Find the additive inverse of each element. 

3. Prove: The set M - ((a ,b,c,d) : a,b,c,d G Q} with addition and multiplication 
defined by 

(a, b,c,d) + (e,f,g,h) - (a + e,b+f,c + g,d + h) 

(a, b, c, d)(e, f, g, h) — (ae + bg, af + bh, ce + dg, cf + dh) 
for all (a, b, c, d), ( e, f, g,h)GM is a ring. 
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The associative and commutative laws for ring addition are immediate consequences of the asso¬ 
ciative and commutative laws of addition on Q. The zero element of M is (0,0,0,0) and the additive 
inverse of (a, b, c, d) is (—a, —b, —e, —d) G Af. The associative law for ring multiplication is verified 
as follows: 

{(a,b,e,d)(e,f,g,h)](i,j,k,l) 

= ((ae + bg)i + (af + bh)k, (ae + bg)j + (af + bh)l, (ce + dg)i + (cf -I- dh)k, (ce + dg)j + (cf + dh)l) 
= (a(ei + fk) + b(gi + hk), a(ej + fl) + b(gj + hi), c(ei + fk) + d(gi + hk), c(ej + fl) + d(gj + hi)) 

= (a, b, c, d)(ei + fk, ej + fl, gi + hk, gj + hi) 

= (a, b, c, d)[(e, f, g, h)(i, j, k, /)] 

for all (a, b, c, d), (e, f, g, h), (i, j, k, l) G M. 

The computations required to verify the distributive laws will be left for the reader. 


4. Prove: If % is a ring with zero element z, then for all a &%, a-z - z-a = z. 

Since a + z = a, it follows that 

a' a = (a + z)a — (a • a) + z • a 

Now a • a = (a • a) + z; hence, (a • a) + z • a = (a • a) + z. Then, using the Cancellation Law, we 
have z • a = *. Similarly, a • a = o(o + z) = a*a + a*z and a • z = z. 


5. Investigate the possibility of multiplicative inverses of elements of the ring M of 
Problem 3. 

For any element (a, 6, c, d) ¥= (0,0,0,0) of M, set 


(a, b, c, d)(p, q, r, s) = (op + br, aq + bs, ep + dr, cq + ds) = 
the unity of M, and examine the equations 


for solutions p, q, r, 8. 


(i) 


jap + br = 1 
\ep + dr = 0 


(ii) 


j aq + 6s 
\eq + ds 


0 


( 1 , 0 , 0 , 1 ) 


From (i), we have (ad - bc)p = d; thus, provided ad~bc¥=Q, p = . d , and r =--— 

.. —if „ ad —be ad —be 

Similarly, from (ii), we find q = a ^_^ >e and 8 = ^ •_ ^ . We conclude that only those elements 

(a, b, c,d) G M for which ad — be ¥= 0 have multiplicative inverses. 


6. Show that P — {( a,b,—b,a ): a,b G 1} with addition and multiplication defined by 
(a, b, —b, a) + ( c, d, —d, c) = (a + c, b + d, —b — d, a + c) 
and (a, b, —b, a)(c, d, —d, c) = (ac — bd, ad + be, —ad — be, ac — bd) 

is a commutative subring of the non-commutative ring M of Problem 3. 

First, we note that P is a subset of M and that the operations defined on P are precisely those 
defined on M. Now P is closed with respect to these operations; moreover, (—a, —b, b, —a) G P 
whenever (a, b, -b, o) G P. Thus, by Theorem I, P is a subring of M. Finally, for arbitrary 
(a, b, —b, a), ( c, d, — d, c) G P we have 

(a, b, —b, a)(c, d, —d, e) = (c, d, —d, c)(a, b, —b, a) 

and P is a commutative ring. 


7. Consider the mapping (a, b, -b, a)-* a of the ring P of Problem 6 into the ring / of 
integers. 

The reader will show that the mapping carries 

(a, b, -b, a) + (c, d, -d,c) a + e 
and (a, b, —b, a) • (c, d, —d, c) -* ac — bd 

Now the additive groups of P and I are homomorphic. (Why not isomorphic?) However, since 
ac — bd ac generally, the rings P and I are not homomorphic under this mapping. 
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8. A complex number a + bi, where a, b G /, is called a Gaussian integer. (In Problem 26, 
the reader is asked to show that the set G = {a+ bi : a,b £ 1} of all Gaussian integers 
is a ring with respect to ordinary addition and multiplication on C.) Show that the 
ring P of Problem 6 and G are isomorphic. 

Consider the mapping (a, b, —b, a) -* a+ bi of P into G. The mapping is clearly one-to-one; 
moreover since 

(a, b,-b,a) + (e,d, — d,c) = (o + c, b + d, —b — d, a + e) -* 

(a + e) + (b + d)i = (a+bi) + (c + di) 
and (a, b, —b, a)(e, d, —d, e) = (ae — bd, ad + be, —ad — be, ac — bd) -* 

(ae-bd) + (ad + be)i = (a+ bi)(c + di) 

all binary operations are preserved. Thus P and G are isomorphic. 


9. Prove: If v is an arbitrary element of a commutative ring %, then P = {p • r: r G %} 
is an ideal in %. 

We are to prove that P is a subgroup of the additive group % such that (p • r)s £ P for all * G %. 
For all r,8 € %, we have 

(i) p • r + p • 8 = p(r + j)6P, since r + * 6 thus P is closed with respect to addition. 

(ii) — (p • r) = p(—r) S P whenever p'r € P, since —rG% whenever r£?; by Theorem VII, Chap¬ 
ter 9, Page 84, P is a subgroup of the additive gToup. 

(iii) (p • r)s = p(r • s) € P since (r • s) € 

The proof is complete. 

10. Prove: In any homomorphism of a ring % with multiplication denoted by •, into 

another ring %' with multiplication denoted by a, the set S of elements of % which 

are mapped on z', the zero element of is an ideal in %. 

By Theorem XXI, Chapter 9, Page 88, S is a subgroup of hence, for arbitrary a,b,e S S, 

Properties P1-P4, Page 101, hold and ring addition is a binary operation on S. 

Since all elements of S are elements of Properties P5-P7 hold. Now for all a,b 6 S, a* 6 -» 
hence, a • b & S and ring multiplication is a binary operation on S. 

Finally, for every a € S and g € %, we have 

a • g -» 2' ■=> g’ = and g • a -* g’ = *' = z' 

Thus S is an ideal in H. 


11. Prove: The set %/S — (r + J: r £%} of the cosets of an ideal 3 in a ring % is 
itself a ring with respect to addition and multiplication defined by 


and 


(x + 3) + {y + 3) = (x + y) + 3 
(x + 3)' (y + 3) = (x-y) + 3 


for all x + 3, y + 3 G R/3 


Since 3 is an invariant subgroup of the group it follows that %l3 is a group with respect to 
addition. It is clear from the definition of multiplication that closure is assured. There remains 
then to show that the Associative Law and the Distributive Laws hold. We find for all v> + 3< 

x + 3, v + 3^%>3> 

[(w + 3) • (* + 3)] • (y + 3) = {wx + 3)-(y + 3) = (wx)-y + 3 = w • (x • y) + 3 

= (W + 3)-i*-y + 3) = (* + «$>•[(* + 3 )• (v + 3)] • 


(W + 3)- {(X + 3) + (v + 3)] 


(t 0 + 3)-[(* + v) + 3] = [*• (* + »)]+ 3 

(wx + wy) + 3 = (wx + 3) + (wy + 3) 
(w + 3) • (ac + 3) + (w + 3) • (y + 3) 


and, in a similar manner, 

[(* + 3)+ (y + 3)] • (« + 3) 


(X + 3) • (W + 3) + (v + 3) * (W + 3) 
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12. Prove: The ring G — {a+ bi: a,b G /} is a Euclidean ring. 

Define e(a + pi) = a 2 + p 2 for every <* 4- piG G. It is easily verified that the properties (i) and 
(ii), Page 108, for a Euclidean ring hold. (Note also that e(a + hi) is simply the square of the ampli¬ 
tude of a 4- bi and, hence, is defined for all elements of C.) 

For every a G G and y ¥* zG G, compute x • y -1 = * + ti. Now if every 8 +tie. G, the theorem 
would follow readily; however, this is not the case as the reader will show by taking x = 1 +1 and 
y = 2 + 3t. 

Suppose then for a given * and y that 8 +tie G. Let c + diGG be such that |c — *| — and 
|d — t| — and write x = y(e + di) + r. Then 

8{r) — fl[a — y(c + di)] = «[x — y(s + ti) + y(s + ti) — y(c + di)] 

= »[l /{(* -«) + (*- d)*>] - &(V) < «(V) 

Thus (iii) holds and G is a Euclidean ring. 


Supplementary Problems 

13. Show that S = {2a : x G I) with addition and multiplication as defined on / is a ring while 
T = {2a + 1; a G 1} is not. 

14. Verify that S of Problem 2 is a commutative ring with unity = h. 

15. When a,b G / define a©6 = a+6 + l and o06 = o+ 6 + o6. Show that / is a commutative ring 
with respect to © and O . What is the zero of the ring? Does it have a unit element? 

16. Verify that S = {a,b,c,d,e,f,g} with addition and multiplication defined by 


is a ring. What is its unity? its characteristic? Does it have divisors of zero? Is it a simple ring? 
Show that it is isomorphic to the ring //(7). 

17. Show that Q = {(z 1( z 2 , —z 2 , z,): z 1( z 2 G C} with addition and multiplication defined as in Problem 3 

is a non-commutative ring with unity (1,0,0,1). Verify that every element of Q with the exception 
of the zero element (zj = z 2 = 0 + 0i) has an inverse in the form {z t M, H /A, z,/A}, where 

A = |z,|* + |z 2 | 2 , and thus the non-zero elements of Q form a multiplicative group. 

18. Prove: In any ring %, 

(o) —(—a) = a for every aG% 

(b) a(-b) = -( ab) = (-0)6 for all 0,6 G %. 

Hint, (a) a + [(-a) - (-0)] = 0 + z = a 

19. Consider %, the set of all subsets of a given set S and define, for all A,B G %, 

A®B = AuB-AnB AQB = A n B 

Show that % is a commutative ring with unity. 
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20. Show that S = {(a,b,—b,a): a,b G Q] with addition and multiplication defined as in Problem 6 is 
a ring. What is its zero? its unity? Is it a commutative ring? Follow through as in Problem 5 to 
show that every element except (0,0,0,0) has a multiplicative inverse. 


21. Complete the operation tables for the ring % = {a, b, e, d}: 


+ | b e d 
a a b c d 
b b a d c 
c c d a b 
d d e b a 



Is % a commutative ring? Does it have a unity? What is its characteristic? 
Hint, c • b = {b + d)'b; c • c = c • (b + d); etc. 


22. Complete the operation tables for the ring 9 = {a, b, e, d): 


+ a b e d 
a a b c d 
b b a d e 
e e d a b 
d d e b a 


abed 



Is 9 a commutative ring? Does it have a unity? What is its characteristic? Verify that x 2 = x 
for every * G 9. A ring having this property is called a Boolean ring. 


23. Prove: If 9 is a Boolean ring, then (o) it has characteristic two, (6) it is a commutative ring. 
Hint. Consider (x + y) 2 = x + y when y = x and when y * x. 


24. Let ^ be a ring with unity and let a and 6 be elements of %, with multiplicative inverses a~* and 
b~ l respectively. Show that (a • 6) -1 = 6 _1 a -1 . 


25. Show that (o), {o, 6}, (a, 6, c, d} are subrings of the ring S of Problem 1. 


26. Show that G = {a+bi: a,b S /} with respect to addition and multiplication defined on C is a 
subring of the ring C. 


27. Prove Theorem I, Page 102. 


28. (o) Verify that % — {(zj, z z , z 3 , z 4 ): z„ z 2 , z 3 , z 4 G C } with addition and multiplication defined as in 
Problem 3 is a ring with unity (1,0,0,1). Is it a commutative ring? 

(6) Show that the subset S = {(z„ z 2 , —z 2 , z t ): z lt z 2 G C} of % with addition and multiplication 
defined as on 9 is a subring of %. 


29. List all 15 subrings of S of Problem 1. 


30. Prove: Every subring of a ring % is a subgroup of the additive group %. 

31. Prove: A subset S of a ring is a subring of % provided a —b and a • b G S whenever a,b G S. 

32. Verify that the set //(n) of integers modulo n is a commutative ring with unity. When is the ring 
without divisors of zero? What is the characteristic of the ring //(5) ? of the ring //(6)? 
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33. Show that the ring 1/(2) is isomorphic to the ring of Example 3(a). 

34. Prove Theorem II, Page 104. 

35. (a) Show that Afj = {(a, 0, c, d): a,c,d G Q} and M 2 = {(a, 0,0, d): a,d G Q} with addition and 

multiplication defined as in Problem 3 are subrings of M of Problem 3. 

(6) Show that the mapping M, -» M 2 : (x, 0, y, w) -* (x, 0,0, w) 

is a homomorphism. 

(e) Show that the subset {(0,0, y, 0): y G Q) of elements of M l which in ( b) are mapped into 
(0,0,0,0) G M 2 is a proper ideal in M t . 

(d) Find a homomorphism of M l into another of its subrings and, as in (c), obtain another proper 
ideal m M v 

36. Prove: In any homomorphism of a ring % onto a ring having z' as identity element, let 

3 = {*:*€<*, x-*z'} 

Then the ring %/3 is isomorphic to 

Hint. Consider the mapping a + 3 a ’ where a' is the image of aG% in the homomorphism. 

37. Let a, b be commutative elements of a ring ^ of characteristic two. Show that (a + b) 2 = a 2 + b* = 
(a-b) 2 . 

38. Let 7(bea ring with ring operations + and •; let (a, r), ( b, 8) G %X I. Show that 

(i) % X I is closed with respect to addition (®) and multiplication (©) defined by 

(«,r) © (b.s) = (a + b, r + ») 

(a, r) O (6, #) = (a • 6 + r6 + 8 a, r») 

(ii) ^ X / has (z, 0) as zero element and (z, 1) as unity. 

(iii) ^ X / is a ring with respect to © and O. 

(iv) % X {0} is an ideal in % X /. 

(v) The mapping ^ % X (0) : x (*,0) is an isomorphism. 

39. Prove Theorem IX, Page 108. Hint. For any ideal 3 * {*} in %, select the least e(y), say 8 (b), 
for all non-zero elements y G 3. For every xG 3 write * = b • q + r with q,r G 3 and either r - z 
or 8 (r) < e(b). 


40. Prove Theorem X, Page 108. Hint. Suppose % is generated by a; then a = a • s = s • a for some 
8G<H. For any bG%, b = g* a = q(a- *) = 6 • g, etc. 
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Integral Domains, Division Rings, Fields 

INTEGRAL DOMAINS 

A commutative ring 3), with unity and having no divisors of zero, is called an integral 
domain. 

Example 1: (a) The rings I,Q,R, and C are integral domains. 

(6) The rings of Problems 1 and 2, Chapter 10, Page 108, are not integral domains; 
in each, for example, / • e = a, the zero element of the ring. 

(c) The set S = {r + «\/l7: r,s € /} with addition and multiplication defined as 
on R is an integral domain. That S is closed with respect to addition and multi¬ 
plication is shown by 

(o + 6Vl7) + (e + dy/VJ) = (« + c) + (6+ d)Vl7 G S 
(a + by/vi )(c 4- dyfvi) = (ac + llbd) + (ad + bc)\/vi e S 

for all (a + by/vi ), (e + d\Zl7) € S. Since S is a subset of R, S is without 
divisors of zero; also, the associative laws, commutative laws and distributive 
laws hold. The zero element of S is 0 68 and every a + 6VT? S S has an 
additive inverse -a — by/Tl € S. Thus, S is an integral domain. 

(d) The ring S = {a,b,e,d,e,f,g,h} with addition and multiplication defined by 



Table 11-1 


is an integral domain. Note that the non-zero elements of S form an abelian 
multiplicative group. We shall see later that this is a common property of all 
finite integral domains. 

A word of caution is necessary here. The term integral domain is used by some to 
denote any ring without divisors of zero and by others to denote any commutative ring with¬ 
out divisors of zero. ^ Problem j 

The Cancellation Law for Addition holds in every integral domain 3) since every element 
of 3) has an additive inverse. In Problem 2 we show that the Cancellation Law for Multi¬ 
plication also holds in 3) in spite of the fact that the non-zero elements of 3) do not neces¬ 
sarily have multiplicative inverses. As a result “having no divisors of zero” in the 
definition of an integral domain may be replaced by “for which the Cancellation Law for 
Multiplication holds”. 

In Problem 3, we prove 

Theorem I. Let 3) be an integral domain and 3 be an ideal in 3). Then 3)13 is an integral. 
domain if and only if 3 is a prime ideal in 3i. 
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UNIT, ASSOCIATE, DIVISOR 

Let 3) be an integral domain. An element v of 3) having a multiplicative inverse in 3) is 
called a unit (regular element) of 3). An element b of .0 is called an associate of a G 3) 
if b — v’a where v is some unit of 3). 


Example 2: (a) The only units of I are ±1; the only associates of a & I are —a. 


(b) Consider the integral domain 3> — (r + s\/17: r,s £ /}. Now a = 
is a unit if and only if there exists x + yy/vf £ 3> such that 
(o + byfvi )(* + yy/l7) = (ax + Ylby) + (bx + ay)^Y7 — 1 

+ Ylby = 1 _ . o 

„ we obtain x = -3——rrr 
t ay = 0 a 2 - 176 2 


s + bVri e a> 


1 + Oy/Vi 


From 


and 


x + yy/vi £ 3), i.e., x,y £ I, if and only if o 2 — 17b 2 = ±1; hence, a is a unit 
if and only if a 2 - 17b 2 = ±1. Thus, ±1, 4 ± Vl7, -4 ± y/l7 are units in 3) 
while 2 — Vl7 and -9 — 2\/l7 = (2 — Vl7 )(4 + ) are associates in 3). 


(e) Every non-zero element of 1/(7) = {0,1,2,3,4,5,6} 
Ms l(mod 7), 2 • 4 = l(mod7), etc. 


is a unit of 1/(7) since 
See Problem 4. 


An element a of 3) is a divisor of b G 3> provided there exists an element c of 3) 
such that b = a • c. Every non-zero element b of 3) has as divisors its associates in 3) and 
the units of 3). These divisors are called trivial {improper)-, all other divisors, if any, are 
called non-trivial (proper). A non-zero, non-unit element b of 3>, having only trivial divisors, 
is called a prime (irreducible element) of 3). An element b of 3), having non-trivial divisors, 
is called a reducible element of 3). For example, 15 has non-trivial divisors over I but not 
over Q; 7 is a prime in I but not in Q. 

See Problem 5. 

There follows 

Theorem II. If 3) is an integral domain which is also a Euclidean ring then, for a ^ z, 
b ¥* z of 3>, 

9(a • 6) = 6(a) if and only if b is a unit of 3) 


SUBDOMAINS 

A subset 3)' of an integral domain 3), which is itself an integral domain with respect 
to the ring operations of 3), is called a subdomain of 3). It will be left for the reader to 
show that z and u, the zero and unity elements of 3), are also the zero and unity elements 
of any subdomain of 3). 

One of the more interesting subdomains of an integral domain 3) (see Problem 6) is 
3)' = { nu : n e /} 

where nu has the same meaning as in Chapter 10. For, if 3>" be any other subdomain of 3), 
then 3)' is a subdomain of 3>" and hence, in the sense of inclusion, 3>' is the least subdomain 
in 3). Thus, 

Theorem III. If 3) is an integral domain, the subset 3)' = {nu : n € /} is its least sub- 
domain. 

By the characteristic of an integral domain 3> we shall mean the characteristic, as 
defined in Chapter 10, of the ring 3>. The integral domains of Example 1(a) are then of 
characteristic zero while that of Example 1(d) has characteristic two. In Problem 7, 
we prove 

Theorem IV. The characteristic of an integral domain is either zero or a prime. 
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Let 3) be an integral domain having <2)' as its least subdomain and consider the mapping 
/ -*• 3 )': n-> nu 

If 3) is of characteristic zero, the mapping is an isomorphism of 7 onto 3)'\ hence, in 3) we 
may always replace 3>’ by 7. If 3) is of characteristic p (a prime), the mapping 
I/(p) -* 3 )': [n] -» nu 

is an isomorphism of I/(p) onto 3)'. 


ORDERED INTEGRAL DOMAINS 

An integral domain 3) which contains a subset 2 5 + having the properties: 

(i) 3>+ is closed with respect to addition and multiplication as defined on 3), 


(ii) 


for every a € 3), one and only one of 

a-z a G 3) + 


holds, 


—a G 3) + 


is called an ordered integral domain. The elements of 3> + are called the positive elements 
of D; all other non-zero elements of 3) are called negative elements of 3). 


Example 3: The integral domains of Example 1(a) are ordered integral domains. In each, the 
set 3)+ consists of the positive elements as defined in the chapter in which the 
domain was first considered. 


Let 3) be an ordered integral domain and, for all a,b G 3>, define 
a>b when a — bG3) + 


and a < b if and only if b > a 

Since a>z means a G 3) + and a<z means —a G 3 ) + , it follows that, if a¥=z, then 
a 2 G 3) + . In particular, uG3) + . 

Suppose now that 3 > is an ordered integral domain with 3) + well-ordered; then u is the 
least element of 3) + . For, should there exist a G 3 ) + with z<a<u, then z< a 2 <au = a. 
Now a 2 G 3)+ so that 3) + has no least element, a contradiction. 


In Problem 8, we prove 

Theorem V. If 3) is an ordered integral domain with 3 )+ well-ordered, then 

(i) <0 + = [pu: p G 7 + } 

(ii) 3) = {mu: m G 7} 

Moreover, the representation of any a G 3) as a = mu is unique. 

There follow 

Theorem VI. Two ordered integral domains 3)i and 3) 2 , whose respective sets of positive 
elements <2>, + and <Z> 2 + are well-ordered, are isomorphic. 

and 

Theorem VII. Apart from notation, the ring of integers 7 is the only ordered integral 
domain whose set of positive elements is well-ordered. 


DIVISION ALGORITHM 

Let 3) be an integral domain and suppose d G 3> is a common divisor of the non-zero 
elements a,b G 3). We call d a greatest common divisor of a and b provided for any 
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other common divisor d' G 3), we have d’ \ d. When 3) is also a Euclidean ring, d' \ d is 
equivalent to 0(d) > 0(d’). 

(To show that this definition conforms with that of the greatest common divisor of two 
integers as given in Chapter 5, suppose ±d are the greatest common divisors of a,b G I 
and let d’ be any other common divisor. Since for n G I, 0(n) = \n\, it follows that 
6(d) = 0(-d) while 6(d) > 0(d’).) 

We state for an integral domain which is also a Euclidean ring 

The Division Algorithm. Let a¥= z and b be in 3>, an integral domain which is also 
a Euclidean ring. There exist unique q,r G 3) such that 

b - q-a + r, 0^ 0(r) < 0(q) 

See Problem 5, Chapter 5. 

UNIQUE FACTORIZATION 

In Chapter 5 it was shown that every integer a > 1 can be expressed uniquely (except 
for order of the factors) as a product of positive primes. Suppose a = pi • • p 3 is such 

a factorization. Then 

-a = -pi'Pi'Pz = Pi(-Pi)P3 = Pi’P 2 (-p 3 ) = (-1 )pi-p 2 -p 9 = (- 1 )Pi‘(- 1 )P 2 -(- 1 )Ps 
and this factorization in primes can be considered unique up to the use of unit elements 
as factors. We may then restate the unique factorization theorem for integers as follows: 
Any non-zero, non-unit element of I can be expressed uniquely (up to the order of 
factors and the use of unit elements as factors) as the product of prime elements of I. 
In this form we shall show later that the unique factorization theorem holds in any 
integral domain which is also a Euclidean ring. 

In Problem 9, we prove 

Theorem VIII. Let J and K, each distinct from {«}, be principal ideals in an integral do¬ 
main 3). Then J — K if and only if their generators are associate ele¬ 
ments in 3). 

In Problem 10, we prove 

Theorem IX. Let a,b,p G 3>, an integral domain which is also a principal ideal ring, 
such that p | a'b. Then if p is a prime element in 3), p | a or p \ b. 

A proof that the unique factorization theorem holds in an integral domain which is also 
a Euclidean ring (sometimes called a Euclidean domain) is given in Problem 11. 

As a consequence of Theorem IX, we have 

Theorem X. In an integral domain 3) in which the unique factorization theorem holds, 
every prime element in 3> generates a prime ideal. 


DIVISION RINGS 

A ring of, whose non-zero elements form a multiplicative group, is called a division ring 
(skew field or sfield). Every division ring has a unity and each of its non-zero elements has 
a multiplicative inverse. Multiplication, however, is not necessarily commutative. 

Example 4: (o) The rings Q, R, and C are division rings. Since multiplication is commutative, 

they are examples of commutative division rings. 

(b) The ring Q of Problem 17, Chapter 10, is a non-commutative division ring. 

(c) The ring I is not a division ring. (Why?) 
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Let 3) be an integral domain having a finite number of elements. 


we have 


{b'x: xG3)} = 3) 


For any b ¥= z G 3), 


since otherwise b would be a divisor of zero. Thus, b-x — u for some x G 3> and b has 
a multiplicative inverse in 3). We have proved 

Theorem XI. Every integral domain, having a finite number of elements, is a commuta¬ 


tive division ring. 


We now prove 

Theorem XII. Every division ring is a simple ring. 

For, suppose 3 ¥* {z} is an ideal of a division ring ef. If a ¥* z G 3, we have a -1 G of 
and a-a~ l — uG3. Then for every bG<~f, b‘U = bG3; hence, 3 = <J- 


FIELDS 

A ring f whose non-zero elements form an abelian multiplicative group is called a field. 

Example 5: (a) The rings, Q, R, and C are fields. 

(6) The ring S of Example 1(d) is a field. 

(c) The ring M of Problem 3, Chapter 10, Page 108, is not a field. 

Every field is an integral domain; hence, from Theorem IV, Page 115, there follows 
Theorem XIII. The characteristic of a field is either zero or is a prime. 

Since every commutative division ring is a field, we have (see Theorem XI) 

Theorem XIV. Every integral domain having a finite number of elements is a field. 

Any subset f' of a field f, which is itself a field, is called a sub field of f. 

Example 6: Q is a subfield of the fields R and C; also, R is a subfield of C. 

See also Problem 12. 

Let f be a field of characteristic zero. Its least subdomain, I, is not a subfield. How¬ 
ever, for each b ¥* 0, b G /, we have b~ l Gf; hence, for all a,b GI with 6^0, it follows 
that a* 6 _1 = a/b Gf. Thus, Q is the least subfield of f. Let f be a field of character¬ 
istic p, a prime. Then I/(p), the least subdomain of f is the least subfield of f. 

A field f which has no proper subfield f is called a prime field. Thus Q is the prime 
field of characteristic zero and I/{p) is the prime field of characteristic p, where p is a 
prime. 

We state without proof 

Theorem XV. Let f be a prime field. If f has characteristic zero, it is isomorphic to Q; 
If f has characteristic p, a prime, it is isomorphic to I/(p). 

In Problem 13, we prove 

Theorem XVI. Let 3) be an integral domain and 3 an ideal in 3). Then 3) 13 is a field 
if and only if 3 is a maximal ideal in 3>. 
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Solved Problems 

1. Prove: The ring I/(m) is an integral domain if and only if to is a prime. 

Suppose to is a prime p. If [r] and [s] are elements of //(p) such that [r] • [s] = [0], then 

r-s = 0(mod p) and r = 0(mod p) or s = 0(modp). Hence, [r] = [0] or [s] = [0]; and//(p), having 

no divisors of zero, is an integral domain. 

Suppose m is not a prime, that is, suppose m = m 1 -m 2 with 1 < m u m 2 < m. Since [ m] — 

[mj] • [m 2 ] = [0] while neither [wij] = 0 nor [m 2 ] = [0], it is evident that I/(m) has divisors of zero 

and, hence, is not an integral domain. 


2. Prove: For every integral domain the Cancellation Law of Multiplication 
If a• c = b-c and c ¥* z, then a — b 

holds. 

From a • e = 6 • c we have o • c — b-c = (a — 6) • c = z. Now D has no divisors of zero; hence, 
a — b = z and a = b as required. 


3. Prove: Let 2) be an integral domain and 3 be an ideal in 2). Then 2>/3 is an integral 
domain if and only if 3 is a prime ideal in 2). 

The case 3 — 2) is trivial; we consider 3 c 2). 

Suppose 3 is a prime ideal in 2). Since 2) is a commutative ring with unity, so also is 2)1 3- 
To show that 2)13 > s without divisors of zero, assume a + 3 , b + 3 e 2)13 such that 
(a + 3 )(b + 3 ) = a-b + 3 = 3 

Now a-b S 3 and, by definition of a prime ideal, either a € 3 or b S 3- Thus either a + 3 or 
b + 3 's the zero element in 2 ) 13 '- and 2)1 3 - being without divisors of zero, is an integral domain. 

Conversely, suppose 2)13 is an integral domain. Let a ¥• z and b ¥* z of 2) be such that 
a-b & 3- From 

3 = a-b + 3 = (a + 3 ){b + 3 ) 

it follows that a + 3 = 3 or b + 3 = 3- Thus 0 * b € 3 implies either a e 3 or b S 3 , and 3 
is a prime ideal in 2). 

Note. Although 2) is free of divisors of zero, this property has not been used in the above proof. 
Thus, in the theorem, “Let 2) be an integral domain” may be replaced by “Let % be a commutative 
ring with unity”. 

4. Let t be some positive integer which is not a perfect square and consider the integral 
domain 2) — {r + sv^t: r,s G/}. For each p = r + sy't € 2), define p = r — s\ft and 
the norm of P as N( P ) = p • j». From Example 2(6), Page 115, we infer that p = a + by/t 
is a unit of 2) if and only if N( P ) = ±1. Show that- for a = a + by/t G 2) and 
p = c + dy/t G 2), N{a-p) = N(a)-N{p). 

We ha ve a- p = (ac + bdt) + (ad+ bc)yft and a-p = ( ac+ bdt) — (ad + bc)^/t. Then N{a - p) = 
(a • p)(a - P) = (ac + bdt)* - (ad + bc)H = (a? - bH)(c? - cPt) = N( a ) - N(p), as required. 


5. In the integral domain 2) = {r + s^/17: r,s G /}, verify: (o) 9 — 2\/l7 is a prime, 
(6) y = 15 + 7\/l7 is reducible. 

(а) Suppose a,p G 2) such that a - p = 9 — 2y/l7. By Problem 4, 

N( a -p) = N( a )-N(p ) = N( 9-2V17) = 13 

Since 13 is a prime integer, it divides either N(a) or N(p); hence either p or a is a unit of 2), and 
9 — 2\/l7 is a prime. 

(б) Suppose a-a + byfri, p = c + dy/vj G 2) such that « • p = y = 15 + 7\/l7; then N(a)-N(p) = 
—608. From N(a ) = a 2 — 17 b 2 = 19 and N(p) = c 2 — 17 cP = —32, we obtain a = 6 — y/vf and 
p = 11 + 3\/l7. Since a and p are neither units of 2) nor associates of y, 15 + 7\/l7 is reducible. 
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6. Show that 3’ — { nu : n G /}, where u is the unity of an integral domain 2>, is a sub- 
domain of < 2 ). 

For every ru, su G 3', we have 

ru + su = (r + s)m G 3' and (ru)(su) = rsu G .[£>' 

Hence 3)' is closed with respect to the ring operations on 3. Also, 

0m = z G 3' and 1 m - u G 3' 

and for each ru G 3' there exists an additive inverse —ru G 3'- Finally, (ru)(su) = z implies 
ru — z or su — z. Thus 3' is an integral domain, a subdomain of 3- 


7. Prove: The characteristic of an integral domain 3 is either zero or a prime. 

From Examples 1(a) and 1(d) it is evident that there exist integral domains of characteristic zero 
and integral domains of characteristic m > 0. 

Suppose 3 has characteristic m = r » 1 •wi 2 with 1 < m 1 ,m 2 < m. Then mu = (wjM)(m 2 M) = z and 
either m x u — z or m 2 u = z, a contradiction. Thus m is a prime. 


8. Prove: If X> is an ordered integral domain such that 3+ is well-ordered, then 
(i) 3 + = {pu: pS/ + ) (ii) 3 - {mu: m G /} 

Moreover, the representation of any a G 3 as a = mu is unique. 

Since u&3 + it follows by the closure property that 2u = u + u& 3+ and, by induction, that 
pu&3+ for all pG/+. Denote by E the set of all elements of 3 + not included in the set 
{pw : p G /+} and by e the least element of E. Now u G E so that e > u and, hence, e — u G 3 + 
but e — uGE. (Why?) Then e — u = p 1 u, for some p l G/ + , and e = u+p x u = (1 +p x )u = p 2 u, 
where p 2 G /+. But this is a contradiction; hence E = 0, and (i) is established. 

Suppose at 3 but aG,Z> + ; then either a = z or — #GJ + . If a = z, then a = 0 m. 

If -«£^+ then, by (i), — a = mu for some mG/+ so that a = (— m)u, and (ii) is established. 

Clearly, if for any a G 3 we have both a — ru and a = su, where r,s G /, then z = a — a = 
ru — su — (r — s)u and r = s. Thus the representation of each a G 3 as a = mu is unique. 


9. Prove: Let J and K, each distinct from {«}, be principal ideals in an integral domain 3. 
Then J — K if and only if their generators are associate elements in 3. 

Let the generators of J and K be a and 6 respectively. 

First, suppose a and 6 are associates and 6 = a • v, where v is a unit in 3. For any cGK 
there exists some s G 3 such that 

c = b’8 = (a* v)s = a(v • s) = a • where s' € 3 

Then cGJ and KqJ. Now b — a’v implies a = b • v~ »; thus, by repeating the argument with 
any d G J, we have J Q K. Hence, J = K as required. 

Conversely, suppose J = K. Then for some s,t G 3 we have a — b • s and b = a • t. Now 
a = b’s = («•()* = a(t*s) 
so that a — a(t'8) = a(u — t • s) = z 

where u is the unity and z is the zero element in 3- Since a ¥= z, by hypothesis, we have u — t • s = z 
so that t • 8 = u and s is a unit in 3. Thus a and b are associate elements in 3, as required. 

10. Prove: Let a,b,p G 3, an integral domain which is also a principal ideal ring, and 
suppose p | a • b. Then if p is a prime element in 3, p \ a or p \ b. 

If either a or 6 is a unit or if a or 6 (or both) is an associate of p, the theorem is trivial. 
Suppose the contrary and, moreover, suppose p / a. Denote by $ the ideal in 3 which is the inter- 
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section of all ideals in 3) which contain both p and a. Since 3 is a principal ideal, suppose it is 
generated by c G 3 so that p = e • x for some x G 3). Then either (i) x is a unit in 3) or 
(ii) e is a unit in 3). 

(i) Suppose x is a unit in 3)\ then, by Theorem VIII, p and its associate c generate the same 
principal ideal 3- Since a € Jj, we must have 

a = c • g — p'h for some g, h G 3) 

But then p | a, a contradiction; hence, x is not a unit. 

(ii) Suppose c is a unit; then c*c _l = uG t ^ and $= 3). Now there exist >,f G 3) such that 
m = p •» + t • o, where u is the unity of 3). Then 

b = u'b = (p • s)6 + (t • a)b — p(s • b) + t(a • b) 
and, since p | a • b, we have p ] 6 as required. 


11. Prove: The unique factorization theorem holds in any integral domain 3) which is also 
a Euclidean ring. 

We are to prove that every non-zero, non-unit element of 3) can be expressed uniquely (up to 
the order of the factors and the appearance of unit elements as factors) as the product of prime 
elements of 3). 

Suppose a ¥* 0 € 3) for which 6(a) = 1. Write a = 6 • c with b not a unit; then c is a unit and 
a is a prime element in 3), since otherwise 

6(a) = 6(b • e) > e(b) by Theorem II, Page 115 

Next, let us assume the theorem holds for all b G 3) for which 6(b) < m and consider c G 3) 
for which 6(e) = m. Now if c is a prime element in 3), the theorem holds for e. Suppose, on the 
contrary, that e is not a prime element and write c = d- e where both d and e are proper divisors of c. 
By Theorem II, we have 6(d) < m and 6(e) < m. By hypothesis, the unique factorization theorem 
holds for both d and e so that we have, say, 

e = d • e = Pi • p 2 ' P3 ‘ ‘ ‘ P s 

Since this factorization of c arises from the choice d,e of proper divisors, it may not be unique. 

Suppose that for another choice of proper divisors we obtained e = g, • g 2 • g 3 • • • q t . Consider 
the prime factor p t of c. By Theorem IX, Page 117, p, | g t or Pi | (g 2 • g 3 ' • • 9 t ); if Pi /g„ then p, | g 2 
or Pi | (flf® • • • 9 t ); if . Suppose p x | q } . Then q s = /• p x where / is a unit in 3) since, other¬ 

wise, q i would not be a prime element in 3). Repeating the argument on 
Pz • P3 ■' • P* = / -1 * 7i • ' 7j-i * «j+i ' * • q t 

we find, say, p 2 \ q k so that q k = g'p 2 with g a unit in 3>. Continuing in this fashion, we ultimately 
find that, apart from the order of the factors and the appearance of unit elements, the factorization of 
c is unique. This completes the proof of the theorem by induction on m (see Problem 27, Chapter 3, 
Page 37). 

12. Prove: S = {* + yy/Z + zy/9 : x,y,z G Q} is a subfield of R. 

From Example 2, Chapter 10, Page 101, S is a subring of the ring R. Since the commutative 
law holds in R and 1 = 1 + Oy/3 + 0\/9 is the multiplicative identity, it is necessary only to verify 
that for x + yy/H + zy/$ ¥= 0 G S, the multiplicative inverse x ^ xy y/3 + — - ^ xz yf§, 

where D = x 3 + 3y 3 + 9z 3 — 9 xyz, is in S. 


13. Prove: Let 3) be an integral domain and 3 an ideal in 3>. Then 3>/3 is a field if and 
only if 3 is a maximal ideal in 3). 


First, suppose J is a maximal ideal in 3)\ then 3 c 3) and (see Problem 3) 3)13 is a commuta¬ 
tive ring with unity. To prove 3)13 is a field, we must show that every non-zero element has a 
multiplicative inverse. 

For any q G 3) —Jj, consider the subset 


S = 

of 3). For any y G 3) and a + q • 
a • y G JJ; similarly, y(a + q • x) G S. 
Thus any r G 3) may be written as 
we find 


{a + q-x: a G J, * G 3)} 

G S, we have (o + q • x)y = a • y + q(x • y) G S since 
Then S is an ideal in 3) and, since 3 cS, we have S — 3>. 
— a + q • e, where e G 3). Suppose for u, the unity of 3), 

a + q-f, fG3) 





122 


INTEGRAL DOMAINS, DIVISION RINGS, FIELDS 


[CHAP. 11 


From u+3 = (a +3) + (q +J) • (/ +J) = ( q+3)‘(f+3) 

it follows that / +3 is the multiplicative inverse of q +3- Since q is an arbitrary element of 3) ~3> 
the ring of cosets 3)/3 is a field. 

Conversely, suppose 3)13 is a field. We shall assume 3 not maximal in 3) and obtain a con¬ 
tradiction. Let then J be an ideal in 3) such that 3 <Z J C 3). 

For any 3) and any p£ J ~3< define (p+3)~ l '( a +3) = « + 3> then 
a+3 = (P+3)-(»+S> 

Now a — p • s G 3 and, since Jc/, a — p m 8 € J. But p € J; hence a & J, and J — 3), a con¬ 
tradiction of J C 3). Thus 3 is maximal in 3). 

The note in Problem 3, Page 119, also applies here. 


Supplementary Problems 

14. Enumerate the properties of a set necessary to define an integral domain. 

15. Which of the following sets are integral domains, assuming addition and multiplication defined as on R: 

(а) {2a + 1: a € /} (e) {a+by/ 3: a,b S /} 

(б) {2a: a €/> (/) {r + «V3: r,s G Q} 

(c) {ay/ 3: a €/} (g) {a + by/Z + + d^lO : a,6,c,d€/} 

(d) {n/3: r€Q} 

16. For the set G of Gaussian integers (see Problem 8, Chapter 10, Page 110), verify: 

(а) G is an integral domain. 

(б) a = a+bi is a unit if and only if N(a) = a 2 + b 2 = 1. 

(c) The only units are ±1, ±i. 

17. Define S = {(a 1( a 2 , a 3 , a 4 ): a { e R) with addition and multiplication defined respectively by 

(a 1 ,a 2 ,a 3 ,a 4 ) + (6j, b 2 , b a , b 4 ) = (a t +b u a 2 +b 2 , a 3 +b s , a 4 +b 4 ) 

and (oj, a 2 , a 3 , a 4 )(6 lf 6 2 ,6 3 ,6 4 ) = (aj • b u a 2 • b 2 , b 3 , a 4 • b 4 ) 

Show that S is not an integral domain. 

18. In the integral domain 3) of Example 2(6), Page 115, verify: 

(а) 33 ± Sy/VJ and —33 ± 8\/T7 are units. 

(б) 48 - Uy/rf and 379 - 92’/l7 are associates of 5 + Ay/V7. 

(e) 8 = 2 • 2 • 2 = 2(8 + 2-\/i7)(—8 + 2y/l7) = (6 + y/l7)(5 — \/i7), in which each factor is a prime; 
hence, unique factorization in primes is not a property of 3). 

19. Prove: The relation of association is an equivalence relation. 

20. Prove: If, for a G 3), N{a) is a prime integer then a is a prime element of 3). 

21. Prove: A ring % having the property that for each a ¥* z,b e % there exists r€i( such that 

a'r — b is a division ring. 

22. Let 3)' = {[0], [5]} and 3)" = {[0], [2], [4], [6], [8]} be subsets of 3> - I/{ 10). Show: 

(а) 3)' and 3)" are subdomains of 3). 

(б) 3)' and 1/(2) are isomorphic; also, 3)" and //(5) are isomorphic. 

(c) Every t€ 3) can be written uniquely as a = a' + a" where a' S 3)' and a" € 3)". 

(d) For a, 6 e 3), with a = a' + a" and 6 = 6' + 6", a + b = (a' + 6') + (a" + 6") and a • 6 = a' • 6' + 
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23. Prove: Theorem II. 

Hint. If 6 is a unit then «(a) = 8[fe _l (a • 6)] — e(a • 6). If 6 is not a unit, consider a = q(a*b) + r, 
where either r - z or e(r) < e(a • 6), for a¥* z& I Z). 

24. Prove: The set S of all units of an integral domain is a multiplicative group. 

25. Let 3) be an integral domain of characteristic p and 3)' — {* p : x G 3)}. Prove: 

(a) (a ± b)P = ap ± 6 p ( 6) The mapping 3) -> 3)' : x -> *p is an isomorphism. 

26. Show that for all a ¥= z,b of any division ring, the equation ax — b has a solution. 

27. The set SI = {(qj + q 2 i + qtf + q 4 k): q t , q 2 , q 3 , q 4 G R} of quaternions with addition and multiplica¬ 
tion defined by 

(“i + ®2* + a 3 i + a 4 k) + (5, + b 2 i + b 3 j + b 4 k) = (a, + 6,) + (a 2 + b 2 )i + (a 3 + b 3 )j + (a 4 + fe 4 )fe 

and 

(“i + «2* + “3J + °4*) " + b 2 i + ftgj + b 4 k) = (a l b 1 — a 2 b 2 — a 3 b 3 — a 4 b 4 ) + (a t b 2 + a^b^ + a 3 b 4 — a 4 b 3 )i 

+ («i*3 — a 2 b 4 + 0361 + a 4 b 2 )j + («j 64 + a 2 b 3 — a 3 6 2 + a 4 bj)k 
is to be proved a non-commutative division ring. Verify: 

(a) The subsets Q 2 = {(q 4 + q 2 i + 0 j + Ok)}, Q 3 = {(q, + Oi + qtf + Ok)}, and Q 4 = {(<?„ + Ot + 0 j + q 4 k)} 

of SI combine as does the set C of complex numbers; thus, i 2 = j 2 = k 2 = —1. 

(b) q u q 2 , q 3 , q 4 commute with i, j, k. 

(c) ij = k, jk = i, ki = j 

(d) ji = -k, kj = -i, ik = -j 

(e) With Q as defined in Problem 17, Chapter 10, Page 111, the mapping 

Q 3.: (<7i + q 3 i, «s + 94*. “93 + 94*. 9i — 92*) •* (9i + 92* + 931 + 94*) 
is an isomorphism. 

(/) §.isa non-commutative division ring (see Example 4, Page 117). 

28. Prove: A field is a commutative ring whose non-zero elements have multiplicative inverses. 

29. Show that P = {(a, b, — b, a) : a,b G R} with addition and multiplication defined by 

(a,b,-b,a) + (c,d,-d,c) = (a + e, b + d, -b-d, 0 + c) 
and (a, 6, —6, a)(c, d, —d, e) — (ae — fed, ad 4- be, —ad — be, ae — bd) 

is a field. Show that P is isomorphic to C, the field of complex numbers. 

30. (a) Show that {«+6\/3: a,6GQ} and {a + by/2 + ey/E + dVlO : a,6,c,dGQ} are subfields of R. 

(fe) Show that {a + 6\/2 : a, 6 G Q} is not a subfield of R. 

31. Prove: S = {a + 6r: a,b G R, r = —^(1 + l/3i)} is a subfield of C. 

Hint. The multiplicative inverse of a + br ^ 0 G S is ^- a ~ 6 , -5- \ , ,» r G S. 

a 2 — afe + 6 2 a 2 — afe + 6 2 

32. (a) Show that the subsets S = {[0], [5], [10]} and T = {[0], [3], [6], [9], [12]} of the ring //(15) are 

integral domains with respect to the binary operations on //(15). 

(fe) Show that S is isomorphic to 1/(3) and, hence, is a field of characteristic 3. 

(e) Show that T is a field of characteristic 5. 

33. Consider the ideals A = {2g : g G G), B = {5g :gGG}, E = {7g -.gGG}, and F = {(1 + i)g : g G G} 
of G, the ring of Gaussian integers, (a) Show that G/A = G/(2) and G/B = G/( 5) are not integral 
domains, (fe) Show that G/E is a field of characteristic 7 and G/F is a field of characteristic 2. 

34. Prove: A field contains no proper ideals. 

35. Show that Problems 3 and 13 imply: If $ is a maximal ideal in a commutative ring % with unity, 
then $ is a prime ideal in %. 




Chapter 12 


Polynomials 


INTRODUCTION 


A considerable part of elementary algebra is concerned with certain types of functions, 
for example, 1+2 * + 3x , * + 5-4*° + 3*» 


called polynomials in x. The coefficients in these examples are integers although it is not 
necessary that they always be. In elementary calculus, the range of values of x (domain of 
definition of the function) is R. In algebra, the range is C; for instance, the values of x 
1 v/2 

for which 1 + 2x + 3x 2 is 0 are - ^ ± -y- i. 

In the light of Chapter 2, any polynomial in x can be thought of as a mapping of a 
set S (range of x) onto a set T (range of values of the polynomial). Consider, for example, 
the polynomial 1 + y/2 x - 3a: 2 . If S = I, then T C R and the same is true if S = Q or 
S = R; if S = C, then TcC. 

As in previous chapters, equality implies “identical with"; thus two polynomials in x 
are equal if they have identical form. For example, a + bx = c + dx if and only if a = c 
and b = d. (Note that a + bx = c + dx is never to be considered here as an equation in x.) 

It has been our experience that the images of each value of x G S are the same ele¬ 
ments of T when a{x) = p{x) and, in general, are distinct elements of T when <*(x) ¥= fi(x). 
However, as will be seen from Example 1 below, this familiar state of affairs is somehow 
dependent upon the range of x. 

Example 1: Consider the polynomials a(x) = [l]x and P(%) — [l]* 5 , where [1] €//(5), and 
suppose the range of a; to be the field 1/(5) = {[0], [1], [2], [3], [4]}. Clearly, a(x) and 
P(x) differ in form (are not equal polynomials); yet, as is easily verified, their images 
for each x G 1/(5 ) are identical. 

Example 1 suggests that in our study of polynomials we begin by considering them 
as forms. 


POLYNOMIAL FORMS 

Let % be a. ring and let x, called an indeterminate, be any symbol not found in %. 
By a polynomial in x over % will be meant any expression of the form 

«(x) = a 0 x° + aix 1 + 02 X 2 + • • • = 2 akXk > 

in which only a finite number of the a’s are different from z, the zero element of %. 
Two polynomials in x over %, a(x) defined above, and 

p(x) = box 0 + biX 1 + & 2 X 2 + • • • = 2 b k x k , biG% 
will be called equal, «(x) = 0(x), provided a k = b k for all values of k. 
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In any polynomial, as <*(x), each of the components a 0 x°, a x x\a 2 x 2 , ... will be called a 
term ; in any term, as atf, a { will be called the coefficient of the term. The terms of a(x) 
and p{x) have been written in a prescribed (but natural) order and we shall continue this 
practice. Then i, the superscript of x, is merely an indicator of the position of the term aw' 
in the polynomial. Likewise, juxtaposition of <ii and x’ in the term aa' is not to be con¬ 
strued as indicating multiplication and the plus signs between terms are to be thought 
of as helpful connectives rather than operators. In fact, we might very well have written 
the polynomial a(x) above as « = (a 0 ,ai,a 2 , ...). 

If in a polynomial, as a(x), the coefficient a n ¥> z while all coefficients of terms which 
follow are z, we say that a(x) is of degree n and call a n its leading coefficient. In particular, 
the polynomial a<>x 0 + zx x + zx 2 + • • • is of degree zero with leading coefficient a<> when 
a 0 ¥= z and has no degree (and no leading coefficient) when a 0 = z. 

Denote by %[x] the set of all polynomials in x over % and, for arbitrary a(x), /3(x) G %[x], 
define addition ( + ) and multiplication (•) on by 

a(x) + fi(x) = (a 0 -I- &o)x° + (ai + bi)x l + (a 2 + b 2 )x 2 + • • • 

= 2(<t* + &*)z fc 

and «(x) • /3(x) = a 0 box° + (aobi + aib 0 )x x + (aob 2 + affii + a 2 b t )x 2 + • • • 

= 2 CkX k , where c k = 2) a t b k -t 

(Note that multiplication of elements of % is indicated here by juxtaposition.) 

The reader may find it helpful to see these definitions written out in full as 
a(x) EE] P(x) - (Oo © bo)x° + (Oi © bi)x x + (a 2 © b 2 )x 2 + • • • 
and a(x) 03 j8(x) = (a 0 O b 0 )x° + (oo O bi © ai O b 0 )x x 

+ (<io ©6 2 © <*i © b\ © a 2 O bt)x 2 + • • • 

in which [T] and CD are the newly defined operations on ^[z], © and O are the binary 
operations on % and, again, + is a connective. 

It is clear that both the sum and product of elements of ^[x] are elements of ^[a:], 
i.e., have only a finite number of terms with non-zero coefficients G %. It is easy to verify 
that addition on ^[a;] is both associative and commutative and that multiplication is asso¬ 
ciative and distributive with respect to addition. Moreover, the zero polynomial 

zx° + zx 1 + zx 2 + ■ ■ ■ — £zx h G ^.[x] 

is the additive identity or zero element of ^[a;] while 

-«(a;) = -oox° + (-ai)x 1 + {-a 2 )x 2 + ■ ■ ■ = 2 ( ~a k )x k G %[x] 

is the additive inverse of a(x). Thus, 

Theorem I. The set of all polynomials % [x] in x over % is a ring with respect to addition 
and multiplication as defined above. 

Let a(x) and /?(x) have respective degrees m and ». If m/ n, the degree of «(x) + /?(x) 
is the larger of m,n; if m = n, the degree of o(x) + p(x) is at most m (why?). The degree 
of a(x) • p(x) is at most m + n since a m b„ may be z. However, if % is free of divisors of zero, 
the degree of the product is m + n. (Whenever convenient we shall follow practice and 
write a polynomial of degree m as consisting of no more than m- 1-1 terms.) 
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Consider now the subset S — {rx°: r G%) of ^[x] consisting of the zero polynomial 
and all polynomials of degree zero. It is easily verified that the mapping 

% -> S: r-* rx° 

is an isomorphism. As a consequence, we may hereafter write a 0 for a 0 x° in any poly¬ 
nomial a(x) £ ^[x]. 


MONIC POLYNOMIALS 

Let % be a ring with unity u. Then u = ux° is the unity of *3([x] since ux° • a(x) — a(x) 
for every «(x) G %[x]. Also, writing x = ux 1 - zx°+ ux\ we have x £ <R[x]. Now 
a k (x • x • x • • • to k factors) = a k x k G %[x] so that in «(x) = a 0 + aix + a 2 x 2 + • • • we may con¬ 
sider the superscript i in a#* as truly an exponent, juxtaposition in any term a t x' as 
(polynomial) ring multiplication, and the connective + as (polynomial) ring addition. 

Any polynomial a(x) of degree m over % with leading coefficient u, the unity of %, will 
be called monic. 

Example 2: (a) The polynomials 1, x + 3, and x 2 -5x + 4 are monic while 2x 2 - x + 5 is not a 

monic polynomial over / (or any ring having / as a subring). 

(6) The polynomials b, bx + f, and bx 2 + dx + e are monic polynomials in S[*] over 
the ring S of Example 1(d), Chapter 11, Page 114. 


DIVISION 

In Problem 1 we prove the first part of 

Theorem II. Let % be a ring with unity u, «(x) = ao + aix + a 2 x 2 + • • +a m x m G %[x] be 
either the zero polynomial or a polynomial of degree m, and /?(x) = 
bo + bix + bix 2 + • • • + ux n G <^[x] be a monic polynomial of degree n. Then 
there exist unique polynomials Q R (x),r R (x)-,q L (x),r L (x)G%[x] with r R (x),r L (x) 
either the zero polynomial or of degree < n such that 

(i) «(x) = q R (x)"fi(x) + r R (x) 

and (ii) «(x) = /3(x) • q L (x) + r L (x) 


In (i) of Theorem II we say that «(x) has been divided on the right by (i(x) to obtain the 
right quotient q R (x) and right remainder r R (x). Similarly, in (ii) we say that a(x) has been 
divided on the left by p{x) to obtain the left quotient q L (x) and left remainder r L (x). 
When r R (x) = z (r L (x) = z), we call p(x) a right (left) divisor of «(x). 

For the special case /3(x) = ux — b — x — b, Theorem II yields (see Problem 2), 


Theorem III. 


The right and left remainders when «(x) is divided by x - b, b G %, are 
respectively 

r = ao + aib + a 2 b 2 + • • • + a n b n 


and 


r L = ao + bai -I- b 2 a 2 + • • • + b n a n 


There follows 

Theorem IV. A polynomial a(x) has x-6 as right (left) divisor if and only if r R = z 
(r L = z). 

Examples illustrating Theorems II-IV when % is non-commutative will be deferred 
until Chapter 15. The remainder of this chapter will be devoted to the study of certain 
polynomial rings ^[x] obtained by further specializing the coefficient ring %. 
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COMMUTATIVE POLYNOMIAL RINGS WITH UNITY 

Let % be a commutative ring with unity. Then <£[*] is a commutative ring with unity 
(what is its unity ?) and Theorems II-IV may be restated without distinction between right 
and left quotients (we replace q R (x) = q L (x) by q(x)), remainders (we replace r R (x) = r L (x) by 
r (x)), and divisors. Thus (i) and (ii) of Theorem II may be replaced by 
(iii) «(*) = q(x)-/3(x) 4- r(x) 

and, in particular, we have 

Theorem IV'. In a commutative polynomial ring with unity, a polynomial «(x) of degree 
m has x - b as divisor if and only if the remainder 

(a) r — a 0 + dib + atb 2 + • • • 4- a m b m = z 

When, as in Theorem IV', r = z then b is called a zero (root) of the polynomial a(x). 

Example 3: (a) The polynomial sc*-4 over I has 2 and -2 as zeros since (2) 2 - 4 = 0 and 

(-2)2-4 = 0. 

(6) The polynomial [3]*2-[4] over the ring 1/(8) has [2] and [6] as zeros while the 
polynomial [l]* 2 -[i] over 1/(8) has [1], [3], [5], [7] as zeros. 

When % is without divisors of zero so also is ^[x]. For, suppose «(x) and /?(x) are 
elements of ^[x], of respective degrees m and n, and that 

a(x)’f}(x) = dobo + (dobi+dlbo)X + ••• + dmb n X m + n = z 

Then each coefficient in the product and, in particular d m b„, is z. But % is without divisors 
of zero; hence a m b„ = z if and only if a m = z or b n = z. Since this contradicts the assump¬ 
tion that a(x) and /3(x) have degrees m and », <^[x] is without divisors of zero. 

There follows 

Theorem V. A polynomial ring <3f[x] is an integral domain if and only if the coefficient 
ring % is an integral domain. 


SUBSTITUTION PROCESS 

An examination of the remainder 

( a ) r = do 4- dib 4- d2 b 2 + • • • + d m b m 

in Theorem IV' shows that it may be obtained mechanically by replacing x by b throughout 
a(x) and, of course, interpreting juxtaposition of elements as indicating multiplication in %. 
Thus, by defining f(b) to mean the expression obtained by substituting b for x throughout 
f(x), we may (and will hereafter) replace r in (a) by «(&). This is, to be sure, the familiar 
substitution process in elementary algebra where (let it be noted) x is considered a variable 
rather than an indeterminate. 

It will be left for the reader to show that the substitution process will not lead to 
future difficulties, that is, to show that for a given b G %, the mapping 

f( x ) _> ' /(&) for all f(x) e ^[x] 

is a homomorphism of ^[x] onto %. 


THE POLYNOMIAL DOMAIN f[x] 

The most important polynomial domains arise when the coefficient ring is a field f. 
We recall that every non-zero element of a field f is a unit of f and restate for the integral 
domain f[x] the principal results of the sections above as follows: 
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The Division Algorithm. If a(x),p(x) G f[x] where p{x) + z, there exist unique poly¬ 
nomials q(x), r(x) with r(x) either the zero polynomial or of degree less than that of p{x). 


such that 


a (x) = q(x) • p(x) + r(x) 

For a proof, see Problem 4. 


When r(x) is the zero polynomial, p(x) is called a divisor of «(x) and we write P(x) \ a(x). 
The Remainder Theorem. If «(x), x - 6 G f[x], the remainder when «(x) is divided by 
x — b is «( 6 ). 

The Factor Theorem. If «(x) G f[x] and bGf, then x - b is a factor of a(x) if and 
only if a(b) = z, that is, x - b is a factor of a(x) if and only if b is 
a zero of a(x). 

There follow 

Theorem VI. Let a(x) G f[x] have degree m> 0 and leading coefficient o. If the dis¬ 
tinct elements 61 , 62 , .. ., 6 m of f are zeros of a(x), then 

a(x) = a(x - 6 i)(x - 62 ) • • • (x - 6 m ) 

For a proof, see Problem 5. 


Theorem VII. Every polynomial <*(x) G y[x] of degree m > 0 has at most m distinct 
zeros in J*. 

Example 4: (a) The polynomial 2x 2 + 7x — 15 S Q[*] has the zeros S/2,—5 , € Q. 

(b) The polynomial x 2 + 2x + 3 € C[x\ has the zeros -1 + V2i and -X-yfii 
over C. However, x 2 + 2x + 3 e Q[z] has no zeros in Q. 

Theorem VIII. Let a{x),p{x) G f[x) be such that «(s) = p{s) for every s € f . Then, if 
the number of elements in f exceeds the degrees of both a(x) and p{x), we 
have necessarily «(x) = p(x). For a proof; see Problem 6 . 

Example 5: It is now clear that the polynomials of Example 1 are distinct, whether considered 
as functions or as forms, since the number of elements of f = 1/(5) does not exceed 
the degree of both polynomials. What then appeared in Example 1 to be a contra¬ 
diction of the reader’s past experience was due, of course, to the fact that his past 
experience had been limited solely to infinite fields. 


PRIME POLYNOMIALS 

It is not difficult to show that the only units of a polynomial domain f[x] are the 
non-zero elements (i.e., the units) of the coefficient ring f. Thus the only associates of 
«(x) G f[x] are the elements v • «(x) of f[x] in which v is any unit of f. 

Since for any v^zGf and any «(x) G f[x], 

«(x) = V~ x -a(x)'V 

while, whenever a(x) = q(x) • p(x), 

«(x) = [v~ x q(x)] • [v p(x)] 

it follows that (a) every unit of f and every associate of «(x) is a divisor of «(x) and 
( 6 ) if p(x) | «(x) so also does every associate of j 8 (x). The units of f and the associates of 
«(x) are called trivial divisors of «(x). Other divisors of «(x), if any, are called non-trivial 
divisors. 

A polynomial «(x) G f{x] of degree m^l is called a prime ( irreducible) polynomial 
over f if its only divisors are trivial. 



CHAP. 12] 


POLYNOMIALS 


129 


Example 6: (a) The polynomial 3* 2 + 2* + 1 6 ft[x] is a prime polynomial over R. 

(b) Every polynomial ax + b G f[x), with a * z, is a prime polynomial over f. 


THE POLYNOMIAL DOMAIN C[x] 

Consider an arbitrary polynomial 

p{x) = b 0 + bix + b 2 x 2 + • • • + b m x m G C[x] 
of degree m - 1. We shall be concerned in this section with a number of elementary 
theorems having to do with the zeros of such polynomials and, in particular, with the 
subset of all polynomials of C[x] whose coefficients are rational numbers. Most of the 
theorems will be found in any College Algebra text stated, however, in terms of roots of 
equations rather than in terms of zeros of polynomials. 

Suppose r EC is a zero of /?(*). Then /3(r) = 0 and, since b~ l G C, also b~ l • p(r) = 0. 
Thus the zeros of p(x) are precisely those of its monic associate 

a ( x ) = b~ l 'P(x) = oo + a x x 4- a 2 x 2 + • • • + a m - x x m ~ l + x m 
Whenever more convenient, we shall deal with monic polynomials. 

It is well-known that when m = 1, a(x) = ao + x has —ao as zero and when m = 2, 
a(x) = ao + dix + x 2 has %{—d x — \/af--4ao) and £(— a x + y/a\ — 4o«) as zeros. In Chapter 8, 
it was shown how to find the n roots of any a G C; thus every polynomial x n — aE C[x] 
has at least n zeros over C. There exist formulas (see Problems 16-19) which yield the 
zeros of all polynomials of degrees 3 and 4. It is also known that no formulas can be 
devised for arbitrary polynomials of degree m ^ 5. 

By Theorem VII any polynomial «(*) of degree m ^ 1 can have no more than m distinct 
zeros. In the paragraph above, <*(*) = a 0 4- a x x + x 2 will have two distinct zeros if and 
only if its discriminant a?-4a„^0. We shall then call each a simple zero of a(x). 
However, if a, - 4a 0 = 0, each formula yields -£ai as a zero. We shall then call -idi 
a zero of multiplicity two of a(x) and exhibit the zeros as -£a,, -£ai. 

Example 7: (a) The polynomial x3 + x 2 -5x + Z = (x - 1)*(* + 3) has -3 as simple zero and 

1 as zero of multiplicity two. 

(b) The polynomial **-*3-3* 2 + 5*-2 = (* - 1)8 (* + 2) has -2 as simple zero 
and 1 as zero of multiplicity three. 

The so-called 

Fundamental Theorem of Algebra. Every polynomial «(x) G C[x] of degree m ^ 1 has 
at least one zero in C. 

will be assumed here as a postulate. There follows, by induction 

Theorem IX. Every polynomial a(x) G C[x] of degree m ^ 1 has precisely m zeros over C, 
with the understanding that any zero of multiplicity n is to be counted as 
n of the m zeros. 

and, hence, 

Theorem X. Any a(x) G C[x] of degree m^l is either of the first degree or may be 
written as the product of polynomials G C[x] each of the first degree. 

Except for the special cases noted above, the problem of finding the zeros of a given 
polynomial is a difficult one and will not be considered here. In the remainder of this 
section we shall limit our attention to certain subsets of C[x ] obtained by restricting the 
ring of coefficients. 




130 


POLYNOMIALS 


[CHAP. 12 


First, let us suppose that 

a(x) = ao + aix + a 2 x 2 + ■■ ■ + OmX m G #[x] 
of degree m ^ 1 has r = a + bi as zero, i.e., 

«(r) = o 0 + air + a 2 r 2 + • • • + a m r m = s + ti = 0 

By Problem 2, Chapter 8, Page 79, we have 

a(f) = a 0 + aif + aar 2 +-1- a m f m = s + ti - 0 

so that 

Theorem XI. If r G C is a zero of any polynomial «(x) with real coefficients, then f is 
also a zero of a(x). 


Let r = a + bi, with b * 0, be a zero of «(x). By Theorem XI f = a-bi is also a zero 
and we may write 

«(x) = [x - (« + 6t)] [x-(a- 6*)] • ^(x) 

= [x 2 - 2ax + a 2 + b 2 ] • «,(x) 


where a,(x) is a polynomial of degree two less than that of «(x) and has real coefficients. 
Since a quadratic polynomial with real coefficients will have imaginary zeros if and only if 
its discriminant is negative, we have 

Theorem XII. The polynomials of the first degree and the quadratic polynomials with 
negative discriminant are the only polynomials G /Z[x] which are primes 
over R. 


and 

Theorem XIII. A polynomial of odd degree G R[x] necessarily has a real zero. 

Suppose next that 

p(x) = 6o + bix + biX 2 + • • • + b m x m G Q[x] 

Let c be the greatest common divisor of the numerators of the b’s and d be the least 
common multiple of the denominators of the b’s; then 

a (x) = ^ • P(x) = do + dix + diZC 2 + ■ ■ • + dmX m G Q[«] 

has integral coefficients whose only common divisors are ±1, the units of I. Moreover, 
p(x) and a(x) have precisely the same zeros. 

If r G Q is a zero of a(x), i.e. if 

«(r) = ao + a t r + a 2 r 2 + • • • + a m r m = 0 

there follow readily 

(i) if r G /, then r | a 0 

(ii) if r = s/t, a common fraction in lowest terms, then 

t m • a(s/t) = dot m + Oi St” 1 - 1 + a 2 s 2 t m - 2 + • • • + a»—, s m ~H + d m s m = 0 
so that s | a 0 and t \ d m . We have proved 

Theorem XIV. Let «(*) = a 0 + aix + a 2 x 2 + ■ • • + a m x m 

be a polynomial of degree m ^ 1 having integral coefficients. If s/t G Q, 
with ( s, t) = 1, is a zero of a(x), then s j ao and t \ a m . 
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Example 8: (a) The possible rational zeros of 

a(x) = Zx 3 + 2x 2 - 7x + 2 

are ±1, ±2, ±|. Now «(1) = 0, a(-l) # 0, <*(2) ^ 0, «(-2) = 0, a(£) = 0, 

<*(--J) ** 0, <*(§) ^ 0, <*(—§) #0 so that the rational zeros are 1, —2, J and 
a(x) = 3(x-l)(x + 2)(x-£). 

Note. By Theorem VII, a(x) can have no more than three distinct zeros. 
Thus once these have been found, all other possibilities untested may be dis¬ 
carded. It was not necessary here to test the possibilities —£, §, —§. 

(b) The possible rational zeros of 

a(x) = 4x 5 - 4x* — 5x 3 + 5x 2 + x — 1 
are ±1, ±$, ±£. Now a(l) = 0, «(-l) = 0, «(£) = 0, a(-^) = 0 so that 
a(x) = 4(x-l)(x+l)(x-£)(x + £)-(x-l) 
and the rational zeros are 1,1, —1, 

(c) The possible rational zeros of 

a(x) = x* — 2x* — 5x 2 + 4x + 6 

are ±1, ±2, ±3, ±6. For these, only a(-l) = 0 and a(3) = 0 so that 
a(x) = (x + l)(x — 3)(x 2 — 2) 

Since none of the possible zeros ±1, ±2 of x 2 — 2 are zeros, it follows that 
* 2 — 2 is a prime polynomial over Q and the only rational zeros of a(x) are —1,3. 

(d) Of the possible rational zeros: ±1, ±£, of a(x) = 6x 4 — 5x® + 7x 2 — 5x + 1 

only £ and \ are zeros. Then a(x) = 6(x - £)(x - £)(x 2 + 1) so that x 2 + 1 
is a prime polynomial over Q, and the rational zeros of a(x) are -J, 

(e) The possible rational zeros of 

a(x) = Zx* — 6x® + 4x 2 — lOx + 2 

are —2, — Since none, in fact, is a zero, a(x) is a prime polynomial 

over Q. 

GREATEST COMMON DIVISOR 

Let «(x) and 0(x) be non-zero polynomials in f[x]. The polynomial d(x) e f[x] having 
the properties 

(1) d(x) is monic, 

(2) d(x) | a(x) and d(x) | /3(x), 

(3) for every c(x) € f[x] such that c(x) | «(x) and c(x) | £(x), we have c(x) | d(x), 
is called the greatest common divisor of «(x) and p{x). 

It is evident (see Problem 7) that the greatest common divisor of two polynomials in 
T[ x \ can be found in the same manner as the greatest common divisor of two integers in 
Chapter 5. For the sake of variety, we prove in Problem 8 

Theorem XV. Let the non-zero polynomials «(x) and £(x) be in f[x]. The monic poly¬ 
nomial 

( b) d(x) = s(x) • «(x) + t(x) • /2(x), s(x), t(x) e f[x] 

of least degree is the greatest common divisor of o(x) and /J(x). 

There follow 

Theorem XVI. Let «(x) of degree m^2 and 0(x) of degree n^2 be in f[x\. Then non¬ 
zero polynomials fi(x) of degree at most n -1 and v(x) of degree at most 
m — 1 exist in f'[x\ such that 

(c) M (x) • «(x) + v(x) • /?(x) = z 
if and only if «(x) and /J(x) are not relatively prime. 

For a proof, see Problem 9. 
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and 

Theorem XVII. If o(x), p(x), p(x) G f[x\ with a(x) and p(x) relatively prime, then 
p(x)\a(x)‘P(x) implies p(x)\p(x). 

In Problem 10, we prove 

The Unique Factorization Theorem. Any polynomial «(x), of degree m — 1 and with 
leading coefficient a, in f\x] can be written as 

«(*) = a • [p t (a;)] m ‘ • [p 2 {x)} m * • • • [p/x)]’"' 

where the p t (x) are monic prime polynomials over f 
and the m i are positive integers. Moreover, except for 
the order of the factors, the factorization is unique. 

Example 9: Decompose a(x) = 4x* + Zx 3 + 4x 2 + 4* + 6 over 7/(7) into a product of prime 
polynomials. 

We have, with the understanding that all coefficients are residue classes 
modulo 7, 

a(x) = 4x 4 + 3x 3 + 4*2 + 4x + 6 = 4x* + 24x 3 + 4x 2 + 4x + 20 

= 4(x 4 + 6*® + x 2 + x + 5) = 4(x + l)(x* + 5x 2 + 3* + 5) 

= 4(x + l)(x + 3)(x 2 + 2x + 4) = 4(x + l)(x + 3)(* + 3)(x + 6) 

= 4(x + l)(x + 3) 2 (* + 6) 


PROPERTIES OF THE POLYNOMIAL DOMAIN f[x] 

The ring of polynomials f[x) over a field f has a number of properties which parallel 
those of the ring I of integers. For example, each has prime elements, each is a Euclidean 
ring, (see Problem 11), and each is a principal ideal ring (see Theorem IX, Chapter 10, 
Page 108). Moreover, and this will be our primary concern here, f[x] may be partitioned 
by any polynomial A(*) G f[x] of degree n ^ 1 into a ring 

0 ) = {[«(*)], [/*(*)],...} 

of equivalence classes just as / was partitioned into the ring 7/(m). For any a(x),p(x) G f[ x \ 
we define 

(i) [«(*)] = {«(*) + m(*) ' M*): *(*) e T [*]> 

Then a(a;) G [«(*)], since the zero element of f is also an element of T[%\, and [«(*)] = [^(*)] 
if and only if a(x) = p(x) (mod A.(x)), i.e. if and only if A(x) | («(x) - p(x)). 

We now define addition and multiplication on these equivalence classes by 

[«(*)] + [p(x)] = [«(*) + p(x)} 

and [«(x)] • [p(x)} = \*{x) • p(x)} 

respectively and leave for the reader to prove 

(а) Addition and multiplication are well-defined operations on /(*(*)). 

(б) f[x]/(\(x)) has [z] as zero element and [u] as unity, where z and u respectively are 
the zero and unity of f. 

(c) y[x]/(\(x)) is a commutative ring with unity. 

In Problem 12, we prove 

Theorem XVIII. The ring ^'[xj/^x)) contains a subring which is isomorphic to the 
field f. 
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If A(x) is of degree 1, it is clear that f[x]l{k(x)) is the field f\ if A(x) is of degree 2, 
T[x]/{k{x)) consists of f together with all equivalence classes {[a 0 + aix]: a 0 , ai G f, ai^z}; 
in general, if A(x) is of degree n, we have 

y[*]/(M*)) — {[«0 + dix + 02X 2 + • • • + dn-ia;" -1 ] : a* E 

Now the definitions of addition and multiplication on equivalence classes and the isomor¬ 
phism: a t <-> [Of] imply 

[a 0 + dix + d 2 x 2 + • • • + a«-iX B_1 ] = [oo] + [<ii][x] + [a 2 ] • [x ] 2 + • ■ • + [a„-i] • [x] n_1 
= do + di[x ] + «2[a:] 2 + • • • + dn-\ [x] B-1 
As a final simplification, let [x] be replaced by | so that we have 

jP[x]/(A(x)) = {a 0 + di£ + a 2 | 2 + • • • + a„-i £ B_1 : di €E J'j 

In Problem 13, we prove 

Theorem XIX. The ring y[*]/(A(x)) is a field if and only if A(x) is a prime polynomial 
over f. 

Example 10: Consider \(x) = * 2 —3 G <?[*], a prime polynomial over Q. Now 
Q[x]/(x 2 - 3) = {a 0 + OjJ : a 0 , Oj £ Q) 

is a field with respect to addition and multiplication defined as usual except that in 
multiplication i 2 is to be replaced by 3. It is easy to show that the mapping 

a 0 + o,{ ++ a 0 + ai\/3 

is an isomorphism of Q[*]/(* 2 — 3) onto 

Q[V3] = {ao + «,V3: Oo.cqeQ), 

the set of all polynomials in y/3 over Q. Clearly Q[\/3] C R so that Q[\/3] is the 
smallest field in which x 2 — 3 factors completely. 

The polynomial x 2 —3 of Example 10 is the monic polynomial over Q of least degree 
having \/3asa root. Being unique, it is called the minimum polynomial of y/Z over Q. 
Note that the minimum polynomial of \/S over R is x - y/Z. 

Example 11: Let T = 1/(3) = {0,1,2} and take X(#) = * 2 + l, a prime polynomial over Con¬ 
struct the addition and multiplication table for the field 7'[*]/(X(*)). 

Here y[*]/(\(*)) = {o 0 + : a 0 . “l ^ T) 

= { 0 , 1,2, {, 2£, 1 + {, 1 + 2 £, 2 + {, 2 + 2j) 

Since X({) = ( 2 + 1 = [0], we have | 2 = [-1] = [2], or 2. The required tables are: 


+ 

0 

1 

2 

£ 

24 

1 + 4 

1 + 24 

2 + 4 

2 + 24 

0 

0 

1 

2 

£ 

2£ 

1 + 4 

1 + 24 

2 + 4 

2 + 24 

1 

1 

2 

0 

1 + 4 

1 + 24 

2 + 4 

2 + 24 

£ 

24 

2 

2 

0 

1 

2 + 4 

2 + 24 

4 

24 

1 + 4 

1 + 24 

£ 

£ 

l + £ 

2 + 4 

2£ 

0 

1 + 24 

1 

2 + 24 

2 

2£ 

2£ 

1 + 24 

2 + 24 

0 

4 

1 

1 + 4 

2 

2 + 4 

1 + £ 

1 + £ 

2 + 4 

£ 

1 + 24 

1 

2 + 24 

2 

24 

0 

1 + 2$ 

1 + 24 

2 + 24 

24 

1 

1 + 4 

2 

2 + 4 

0 

4 

2 + £ 

2 + 4 

£ 

1 + 4 

2 + 24 

2 

24 

0 

1 + 24 

1 

2 + 2J 

2 + 24 

2£ 

1 + 24 

2 

2 + 4 

0 

4 

1 

1 + 4 


Table 12-1 
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0 

1 

2 

4 

24 

1 + 4 

1 + 24 

2 + 4 

2 + 24 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

2 

4 

24 

1 + 4 

1 + 24 

2 + 4 

2 + 24 

2 

0 

2 

1 

24 

4 

2 + 24 

2 + 4 

1 + 24 

1 + 4 

4 

0 

4 

24 

2 

1 

2 + 4 

1 + 4 

2 + 24 

I + 24 

24 

0 

24 

4 

1 

2 

1 + 24 

2 + 24 

1 + 4 

2 + 4 

1 + 4 

0 

1 + 4 

2 + 24 

2 + 4 

1 + 24 

24 

2 

1 

4 

I + 24 

0 

I + 24 

2 + 4 

1 + 4 

2 + 24 

2 

4 

24 

1 

2 + 4 

0 

2 + 4 

1 + 24 

2 + 24 

1 + 4 

1 

24 

4 

2 

2 + 24 

0 

2 + 24 

1 + 4 

1 + 24 2 +4 

Table 12-2 

4 

1 

2 

24 


Example 12: Let T-Q and take X(*) = * 3 + a: + l, a prime polynomial over f. Find the mul¬ 
tiplicative inverse of { 2 + { + 16 T [*] / (*(*))• 

Here y[x]/(\(*)) = (oo + + «s4 2 : o 0 .«iand, since X(4) = 4 S + 

{ + 1 = 0, we have {» = — 1 — { and f 4 = —4 — £ 2 . One procedure for finding the 
required inverse is: 

set («o + «i4 + «24 2 )(1 + 4 + 4 2 ) = 1> 
multiply out and substitute for 4 3 and { 4 , 
equate the corresponding coefficients of 4°, 4,4 2 
and solve for a^a^oj. 

This usually proves more tedious than to follow the proof of the existence of the 
inverse in Problem 13. Thus, using the division algorithm, we find 
1 = £(4* + 4 + l)(l-4) + i(4 2 + 4 + l)(€ 2 -24 + 2) 

Then (4 s + f + 1) I [1 - £(4 2 + 4 + 1)<4 2 -24 + 2)] 

so that i(4 2 + 4 + DU 2 - 24 + 2) = 1 

and |(4 2 — 24 + 2) is the required inverse. 

Example 13: Show that the field R[x]/(x 2 + 1) is isomorphic to C. 

We have R[x]/(x 2 + 1) = {a 0 + arf : Oo,<*i € R}. Since 4 2 = -1, the mapping 
<*o + «i4 ** a o + 

is an isomorphism of R[x\!(x 2 +1) onto C. We have then a second method of con¬ 
structing the field of complex numbers from the field of real numbers. It is, how¬ 
ever, not possible to use such a procedure to construct the field of real numbers 
from the rationals. 

Example 13 illustrates 

Theorem XX. If a(x) of degree m — 2 is an element in f[x), then there exists a field f, 
where f C f, in which «(x) has a zero. For proof) gee Problem 14 . 

It is to be noted that over the field f of Theorem XX, «(x) may or may not be written 
as the product of m factors each of degree one. However, if a(x) does not factor completely 
over f, it has a prime factor of degree » — 2 which may be used to obtain a field f", 
with f C f C f", in which «(x) has another zero. Since a(x) has only a finite number of 
zeros, the procedure can always be repeated a sufficient number of times to ultimately 
produce a field J’ l * ) in which a(x) factors completely. g ee p rob i em 15 



CHAP. 12] 


POLYNOMIALS 


135 


Solved Problems 

1. Prove: Let %be a ring with unity u; let 

a(x) - a 0 + aix + a 2 x 2 + ■•• + OmX m G %[x] 
be either the zero polynomial or of degree m\ and let 

P( x ) = b 0 + bix + b 2 x 2 + • • • + ux* G *[*] 
be a monic polynomial of degree n. Then there exist unique polynomials q R (x),r R (x) G ^[a:] 
with r R (x) either the zero polynomial or of degree < n such that 

(i) «(*) = q R (x)-fi(x) + r R (x) 

If a(x) is the zero polynomial or if n> m, then (i) holds with q R (x) = z and r R (x) = a(x). 

Let it — m. The theorem is again trivial if m = 0 or if m = 1 and to = 0. For the case 
m = w = 1» take a(x) — Oo + a t x and fi(x) — b 0 + ux. Then 

a(x) = a 0 + a,x = o 1 (6 0 + ux) + (a 0 — aj6 0 ) 

and the theorem is true with q R (x) = a, and r R (x) = « 0 - o,6 0 . 

We shall now use the induction principle of Problem 27, Chapter 3, Page 37. For this purpose 
we assume the theorem true for all a(x) of degree =£ m-1 and consider a(x) of degree m. Now 
y(x) - a(x) — a m x n m • P(x) € ^[k] and has degree < m. By assumption, 
y(*) = S(x) • fi(x) + r{x) 

with r(x) of degree at most w — 1. Then 

a(x) = y(*) + = («(*) + On,*"-™) • p(x) + r(x) 

= «*(*)•£<*) + r R (x) 

where q R (x) = S(x) + a m x »-« and r R (x) = r(x), as required. 

To prove uniqueness, suppose 

"(*) = q R (*) •/?(*) + r R (x) = q' R (x) • p(x) + r R (x) 

Then (««(*)-?;(*)) •iSl*) = (x) — r R (x) 

Now r R (x) — r R (x) has degree at most to— 1 while, unless q R (x) — q' R (x) = z, (q R (x) — q’ R (x)) • fi(x) 
has degree at least to. Thus q R (x) - q ' R (x) = z and then r' R (x) - r R (x) = z, which establishes 
uniqueness. 

2. Prove: The right remainder when <*(a;) = a 0 + a x x + azx 2 + ■■■ + a m x m G %[x] is divided 
by x - b, b G %, is r R = a 0 + aib + a 2 b 2 + • • • + a m b m . 

Consider 

a(x) - r R = a t (x - 6) + o^x 2 - 6 2 ) + ••• + a m (x”* - 6"») 

= {oi + 02(x + b) + ■ • • + a OT (x m- i + bx”>-2 + ... + 6m-1)} . ( x -b) 

= q R (x)'(x-b) 

Then «(*) = q R (x) '(x-b) + r R 

By Problem 1 the right remainder is unique; hence r„ is the required right remainder. 

3. In the polynomial ring S[a;] over S, the ring of Example 1(d), Chapter 11, Page 114, 

(а) Find the remainder when «(*) = cx* + dx 3 + cx 2 + hx + g is divided by B(x) = 
bx 2 + fx + d. 

(б) Verify that / is a zero of y(x) = cx* + dx 3 + cx 2 + bx + t d. 
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(a) We proceed as in ordinary division, with one 
variation. Since S is of characteristic two, 
s + (— t) = 8 + t for all «,l£S; hence, in the 
step “change the signs and add” we shall 
simply add. 

The remainder is r(x) = gx + f. 

(b) Here f 2 = c, f 3 = ef = e, and / 4 = h. Then 

y(f) = ch + de + c 2 + bf + d 
= d+e+fc + / + d = a 

as required. 


cx 2 + hx + b _ 

bx 2 + fx + d ) ex A + dx 3 + cx 2 + hx + g 
cx 4 + ex 3 + fx 2 


hx 3 + hx 2 + hx 
hx 3 + gx 2 + ex 

bx 2 + dx + g 
bx 2 + fx + d 


9* + f 


4. Prove the Division Algorithm as stated for the polynomial domain f{x\. 

Note. The requirement that fi(x) be monic in Theorem II, Page 126, and its restatement for 
commutative rings with unity was necessary. 

Suppose now that a(x),/3(x) S f[x] with b n ^u the leading coefficient of /Hx). Then, since b n 1 
always exists in f and p'(x) = b n 1 • /Hx) is monic, we may write 

a(x) = q'(x)'P'(x) + r(x) 

= [b ^" 1 qr'( 3C )] ’ [&n' P’( x )} + r ( x ) 

= q(x) • P(x) + r(x) 

with r(x) either the zero polynomial or of degree less than that of p(x). 


5. Prove: Let «(*) €E f[z\ have degree m > 0 and leading coefficient a. If the distinct 
elements b u b 2 ,..., b m of f are zeros of «(*), then «(x) - a(x - bi)(x b 2 ) ■ ■ • (x b m ). 

Suppose m = 1 so that a(x) = ax + a, has, say, 6, as zero. Then «(<>i) = = *« ®i = ~ ab " 

and a(x) = ax += ax - ab l = a(x - b t ) 


The theorem is true for m — 1. 

Now assume the theorem to be true for m = k and consider a<x) of degree k + 1 with zeros 
b 2 . b k + l . Since ^ is a zero of a(x), we have by the Factor Theorem 

a(x) = q(x) • (* - i>i) 


where q(x) is of degree k with leading coefficient a. 

j = 2,3. k +1 and since bj — b 1 ¥* z for all j ■»* 1, 

distinct zeros of q(x). By assumption, 


Since a(bj) — q(bj) • (bj — b t ) — 
it follows that b 2 ,b 3 . b k + l 


q(x ) = a(x — b 2 )(x — b 3 ) ■ ■ ■ (x - b k + 1 ) 


z for 
are k 


Then a ( x ) = ®(* ~ f>i)(x - b 2 ) ■ ■ ■ (x - b k + l ) 

and the proof by induction is complete. 


Prove- Let «(*), p(x) G f [*] be such that «(s) = P(s) for every sGf. Then, if the 
number of elements of f exceeds the degree of both «(x) and 0{x), we have necessarily 
a{x) = P(X). 

Set ylx) = a(x) - P(x). Now y(x) is either the zero polynomial or is of degree p which certamiy 
does not exceed the greater of the degrees of a(x) and /»(*). By hypothesis, yW - »(.) 
every s G f. Then y(x) = z (otherwise y(x) would have more zeros than its degree, contrary to 
Theorem VII, Page 128) % nd «(*) = P(x) as required. 
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7. Find the greatest common divisor of a(x) = 6a; 5 + 7a; 4 - 5a; 3 - 2a; 2 - x + 1 and /?(x) = 
6x 4 - 5x 3 — 19x 2 — 13x — 5 over Q and express it in the form 

d(x) = s(x) • a(x) + t(x)-i 8(x) 


Proceeding as in the corresponding problem with integers, we find 

a(x) = (x + 2) • p(x) + (24X 2 + 49x 2 + 30x + 11) _ qi ( x )'/}(x) + n(x) 

P( x ) ~ ^(8x — 23)*rj(x) + || (3x 2 + 2x + 1 ) = q 2 (x) • rj(x) + r 2 (x) 

ri(x) = ^(256x + 352)*r 2 (x) 

Since r 2 (x) is not monic, it is an associate of the required greatest common divisor d(x) = x 2 + §x + 




r 2 (x) = fi(x) - q 2 (x) • rj(x) 

= P(x) — q 2 (x) • «(x) + q x (x) • g 2 (x) • fi(x) 

= — V2O*) • «(x) + (1 + q t (x) • q 2 (x)) • ;3(x) 

= - ^ (8x - 23) • <r(x) + ^ (8x 2 - 7x - 14) • /?(x) 


and 


d(x) = 2 ^ r2(x) = 


"279 ( 8*- 23) • «(x) + 279 (8* 2 - 7x - 14) • /?(x) 


8. Prove: Let the non-zero polynomials «(x) and /9(x) be in T'fx], The monic polynomial 
d(x) = s«(x)*«(x) + f„(x)-0(x), s 0 (x),t 0 (a;) G f[x] 

of least degree is the greatest common divisor of a(x) and /?(x). 

Consider the set 

S = («(x) • a(x) + t(x) • f)(x) : «(x), t(x) € f[x]} 

Clearly this is a non-empty subset of y[x] and, hence, contains a non-zero polynomial S(x) of least 
degree. For any 6(x) £ 5, we have, by the Division Algorithm, b(x) = q(x) • 8(x) + r(x) where 
r(x) £ S (prove this) is either the zero polynomial or has degree less than that of 8(x). Then r(x) = z 
and b(x) = q(x) • 8(x) so that every element of S is a multiple of 8(x). Hence, S(x) | a(x) and 
8(x) \P(x).' Moreover, since 8(x) = * 0 (x) • a(x) + t 0 (x) • p(x), any common divisor c(x) of a(x) and 
P(x) is a divisor of S(x). Now if 8(x) is monic, it is the greatest common divisor d(x) of «(x) and p(x); 
otherwise, there exists a unit v such that v • 8(x) is monic and d(x) - v • 8(x) is the required greatest 
common divisor. 


9. Prove: Let a(x) of degree m^2 and /?(x) of degree n^2 be in f[x]. Then non-zero 
polynomials M (x) of degree at most n- 1 and v(x) of degree at most m - 1 exist in f[x] 
such that 

(c) /*(*)'«(x) + v(x) • p{x) = z 

if and only if a(x) and /3(x) are not relatively prime. 

Suppose 8(x) of degree p - 1 is the greatest common divisor of o(x) and p(x), and write 
a(x) = « 0 (x) • 8(x) , p(x) = p 0 (x) • 8(x) 

Clearly a 0 (x) has degree — m —1 and Po(x) has degree — n—1. Moreover, 

p„(x) • a(x) = ^o(x) • « 0 (x) • 8(x) = a 0 (x) • [0 o (x) • 8(x)] = « 0 (x) • /3(x) 
so that /?<,(*). «(*) + [-„„(*) . £(*)] = z 

and we have (c) with n(x) = /3 0 (x) and »(x) = — <* 0 (x). 
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Conversely, suppose ( e ) holds with a(x) and p(x) relatively prime. By Theorem XV, Page 131, 
we have 

u — s(x) • a(x) + t(x) • p(x) for some *(*), t(x) G f\x\ 

Then n(x) = n(x) • s(x) • «(*) + /t(x) • t(x) • p(x) 

= s(x)[-,(x)-0(x)] + n(x) • t(x) • P(x) 

= P(x) [/»(*) • t(x) - ,(x) • s(x)] 

and p(x) | /»(*). But this is impossible; hence (c) does not hold if a(x) and p(x) are relatively prime. 


10. Prove; The unique factorization theorem holds in T[x]. 

Consider an arbitrary <*(x) € f[x]. If a(x) is a prime polynomial, the theorem is trivial. If a(x) 
is reducible, write a(x) = a • p(x) • y(x) where p(x) and y(x) are monic polynomials of positive degree 
less than that of a(x). Now either p(x) and y(x) are prime polynomials, as required in the theorem, 
or one or both are reducible and may be written as the product of two monic polynomials. If all factors 
are primes, we have the theorem; otherwise, .... This process cannot be continued indefinitely 
(for example, in the extreme case we would obtain a(x) as the product of m polynomials each of 
degree one). The proof of uniqueness is left for the reader, who also may wish to use the induction 
procedure of Problem 27, Chapter 3, Page 37, in the first part of the proof. 


11. Prove: The polynomial ring f[x) over the field f is a Euclidean ring. 

For each non-zero polynomial a(x) G y[x], define 8(a) = m where m is the degree of a(x). 
If a(x), P(x) G f[x] have respective degrees m and n, it follows that 8(a) = m, 8(p) = n, 8(a ‘ P) = 
m + n and, hence, «(«• p) ^ 8(a). Now we have already established the division algorithm: 

a(x) = q(x) • p(x) + r(x) 

where r(x) is either * or of degree less than that of P(x). Thus, either r(x) = z or e(r) < e(P) as 
required. 


12. Prove: The ring H*]/(*(*)) contains a subring which is isomorphic to the field f. 

Let a, b be distinct elements of f ; then [*], [6] are distinct elements of f[x]/(x(x)) since [a] = [6] 
if and only if \(x) \ (a - b). 

Now the mapping a -» [a] is an isomorphism of f onto a subset of f[x] / (M*)) since it is 
one-to-one and the operations of addition and multiplication are preserved. It will be left for the 
reader to show that this subset is a subring of y[x]/(x(x)). 


13. Prove: The ring f [x]/(a(*)) is a field if and only if k(x) is a prime polynomial over f. 

Suppose X(x) is a prime polynomial over f. Then for any [a(x)] [z] of F[x] / (x(x)), we have 

by Theorem XV, Page 131, 

u = a(x) • p(x) + X(x) • y(x) for some p(x),y(x) G f[x] 

Now X(x) I u - a(x )• p(x) so that [«(x)] • [/3(x)] = [«]. Hence, every non-zero element [a(x)] G f[x]/(x(x)) 
has a multiplicative inverse and y[x]/(x(x)) is a field. 

Suppose X(x) of degree m - 2 is not a prime polynomial over T, i.e. suppose X(x) = n(x) • v(x) 
where n(x), »(x) G f[x\ have positive degrees s and t such that s 4 -1 = m. Then i<«t so that 
X(x) fdx) and [m(x)] ^ [*]; similarly, [»(*)] ^ [z]. But [p(x)] • [r(*)] = [/.(*) * »(*)] = [ x (*)] = W- 
Thus, since [/»(*)], ['(*)] it follows that f[x]/(x(x)) has divisors of zero and is not 
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14. Prove: If a(x) of degree m — 2 is an element of T[x], there exists a field f, where 
T c T'> in which a(x) has a zero. 

The theorem is trivial if a(x) has a zero in f; suppose that it does not. Then there exists a monic 
prime polynomial X(x) G y[x] of degree n — 2 such that X(x) \ a(x). Since X(x) is prime over f, 
define f = f[*]/ (x(x)). 

Now by Theorem XVIII, Page 132, f c f so that a(x) G y'[x]. Also, there exists £Gf such 
that X(£) = [*]. Thus £ is a zero of <*(x) and f is a field which meets the requirement of the theorem. 


15. Find a field in which x 3 — 3 G Q[*] (a) has a factor, (6) factors completely. 

Consider the field Q[x]/(x® - 3) = (c^ + a,£ + a 2 | 2 : a^,a x ,a 2 G Q} 

(a) The field just defined is isomorphic to 

f = {®o + <*iV3 + a 2 y/3 : a 0 ,a 1 ,a 2 G Q} 

in which x 3 — 3 has a zero. 

(b) Since the zeros of x 3 — 3 are y/H, y/3 a, y/3 u 2 , it is clear that x 3 — 3 factors completely in 

r = rn. 


16. Derive formulas for the zeros of the cubic polynomial «(x) = Oo + a\X + a 2 x 2 + x 3 over 
C when a 0 ¥■ 0. 

The derivation rests basically on two changes to new variables: 

(i) If = 0, use x = y and proceed as in (ii) below; if o 2 ¥• 0, use x = y + v and choose v so 
that the resulting cubic lacks the term in y 2 . Since the coefficient of this term is o 2 + 3v, the 
proper relation is x = y — a 2 /3. Let the resulting polynomial be 

P(v) = a(v - <*2/3) = « + PV + V 3 

If q = 0, the zeros of fHy) are 0, y/—p, — y/—p and the zeros of a(x) are obtained by decreasing 
each zero of p(y) by 02/3. If <7 # 0 but p = 0, the zeros of p(y) are the three cube roots p,up, u 2 p 
(see Chapter 8) of — q from which the zeros of «(x) are obtained as before. For the case pq 0, 

(ii) Use y — z — p/Pz to obtain 

y(z) = p(z — p/3z) = z 3 + q — p 3 / 27z* = ** ~ P s /27 

z 3 

Now any zero, say s, of the polynomial 8(z) = z* + qz 3 - p 3 /27 yields the zero s - p/3s - a 2 /3 
of a(x); the six zeros of S(z) yield, as can be shown, each zero of <*(x) twice. Write 

S(z) = [z 3 + $(q - y/q 2 + 4p3/27)] • [z 3 4- $(q + y/q 2 + 4p 3 /27)] 

and denote the zeros of z 3 + ^(<7 - y/q 2 + 4p*/27) by A,aA,u 2 A. The zeros of «(x) are then: 
A — p/3A — 02/3, uA — <o 2 p/3 A — a*J3, and u?A — up/3 A — a 2 /3. 

17. Find the zeros of a(x) — —11 - 3x + Sx 2 + x 3 . 

The substitution x = y — 1 yields 

P(v) = a(y- 1) = -6 -6y + y 3 
In turn, the substitution y = z + 2/z yields 

y(z) = p(z + 2/z) = z 3 + 8/z 3 — 6 = * - 6g3 + 8 = ^ ~ 2 >( z3 ~ 4) 

3 Z 3 2? 

Take A = y/2; then the zeros of a(x) are: 

y/z + yfi — 1, wv^ + w 2 ^-!, co 2 \^2 + <0 yfl - 1 
The reader will now show that, apart from order, these zeros are obtained by taking A = \/4. 
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18. Derive a procedure for finding the zeros of the quartic polynomial 

a(x) = a 0 + dix + chx 2 + asx 3 + x 4 G C[x], when a 0 ¥* 0. 

If a 3 # 0, use x — y — a 3 /4 to obtain 

P(y) = a(p — 03/4) = b 0 + + b 2 y 2 + v 4 

Now p(y) = (p + 2 qy + y 2 )(r — 2qj/ + y 2 ) 

= pr + 2 q(r-p)y + (r + p-4 q 2 )y 2 + y 4 
provided there exist p,q,r G C satisfying 

pr = b 0 , 2 q(r — p) = b lt r + p — 4 q 2 = b 2 

If 61 = 0, take q = 0; otherwise, with q # 0, we find 

2p = b 2 + 4<? 2 — bJ2q and 2r — b 2 + 4 q 2 + b t /2q 

Since 2p • 2r = 46 0 , we have 

(i) 64g® + 326 2 9 4 + 4 (b\- 4b 0 )q 2 - b\ = 0 

Thus, considering the left member of (i) as a cubic polynomial in q 2 , any of its zeros different from 0 
will yield the required factorization. Then, having the four zeros of p(y), the zeros of a(x) are obtained 
by decreasing each by a 3 /4. 

19. Find the zeros of «(») = 35 - 16x - 4r* + x 4 . 

Using X = y + 1, we obtain 

P(y) = a(y + 1) = 16 - 24 y - 6 y 2 + y* 

Here, (i) of Problem 18 becomes 

64q« - 192 q 4 - 112 q 2 - 676 = 16(q 2 - 4)(4q 4 + 4q 2 + 9) = 0 
Take 9 = 2; then p = 8 and r = 2 so that 

16 - 24y -6 y 2 + y* = (8 + 4y + y 2 )(2 -4 y + y 2 ) 
with zeros -2 ± 2i and 2 ± yfi. The zeros of a(x) are: ~1 * 2 i and 3 ± a/2. 


Supplementary Problems 

20. Give an example of two polynomials in x of degree 3 with integral coefficients whose sum is of degree 2. 

21. Find the sum and product of each pair of polynomials over the indicated coefficient ring. (For con¬ 
venience, [a], [6], ... have been replaced by a,b,... .) 

(а) 4 + x + 2x 2 , 1 + 2x + 3x 2 ; //(6) 

( б ) 1 + 5x + 2x 2 , 7 + 2x + 3x 2 + 4X 3 ; 1/(8) 

(c) 2 + 2x + x 3 , 1 + x + x 2 + x*; 1/(3) 

Ana. (a) 3x; 4 + 4x + x 2 + 2x 3 + x 4 

(e) x 2 + x 3 + x 4 ; 2 + x + x 2 + x 2 
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22. In the polynomial ring S[x] over S, the ring of Problem 2, Chapter 10, Page 108, verify: 

(а) (b + gx + fx 2 ) + (d + gx) = e + ex + fx 2 

(б) (b + gx + fx 2 )(d + ex) = b + hx + cx 2 + bx 2 
(e) (b + gx + fx 2 )(d + ex) = b + ex + bx 2 

(d) f + bx and e + ex are divisors of zero. 

(e) c is a zero of f + ex + fx 2 + ex* 4- dx 4 . 

23. Given a(x),fi(x), y(x) G J[x] with respective leading coefficients a,b,e and suppose a(x) = /3(x) • y(x). 
Show that a(x) = a • p'(x) • y'(x) where j8'(x) and y’(x) are monic polynomials. 

24. Show that ,D[x\ is not a field for any integral domain 3). 

Hint. Let «(x) € JZ)[x] have degree > 0 and assume fi(x) G JD[x\ a multiplicative inverse of a(x). 
Then a(x) • fi(x) has degree > 0, a contradiction. 

25. Factor each of the following into products of prime polynomials over (i) Q, (ii) R, (iii) C. 

(o) x* - 1 (c) 6x 4 + 5x3 + 4*2 _ 2x - 1 

(6) x 4 - 4x 2 - x + 2 (d) 4x 5 + 4x* - 13x* - llx 2 + lOx + 6 

Ana. (a) (x - l)(x + l)(x 2 + 1) over Q, R; (x - l)(x 4- l)(x - i)(x + i) over C 

(d) (x - l)(2x + l)(2x + 3)(x 2 - 2) over Q; (x - l)(2x + l)(2x + 3)(x -y[2)(x + yj2) over R, C 

26. Factor each of the following into products of prime polynomials over the indicated field. (See note 
in Problem 21.) 

(а) x 2 + 1; //(5) (c) 2x 2 + 2x + 1; 7/(5) 

(б) x 2 + x + 1; 7/(3) ( d) 3x3 + 4*2 + 3; 7/(5) 

Ana. (a) (x + 2)(x + 3), (d) (x + 2) 2 (3x + 2) 

27. Factor x 4 -1 over (o) 7/(11), (6) 7/(13). 

28. In (d) of Problem 26 obtain also 3x» + 4x 2 + 3 = (x + 2)(x + 4)(3x + 1). Explain why this does not 
contradict the Unique Factorization Theorem. 

29. In the polynomial ring S[x] over S, the ring of Example 1(d), Chapter 11, Page 114, 

(a) Show that bx 2 + ex + g and gx 2 + dx + 6 are prime polynomials. 

(b) Factor hx 4 + ex 3 + ex 2 + b. 

Ana. (6) (bx + b)(ex + g)(gx + d)(hx + e) 

30. Find all zeros over C of the polynomials of Problem 25. 

31. Find all zeros of the polynomials of Problem 26. Aws. (a) 2,3; (d) 1,3,3 

32. Find all zeros of the polynomial of Problem 29(6). Ana. b, e, /, d 

33. List all polynomials of the form 3x 2 + cx + 4 which are prime over 7/(5). 

Arts. 3x 2 + 4, 3x 2 + x + 4, 3x 2 + 4x + 4 

34. List all polynomials of degree 4 which are prime over 7/(2). 

35. Prove: Theorems VII, IX, and XIII. 

36. Prove: If o + byfc, with a,b,c G Q and c not a perfect square, is a zero of «(x) e 7[x], so 






142 


POLYNOMIALS 


[CHAP. 12 


37. Let % be a commutative ring with unity. Show that %[x] is a principal ideal ring. What are its 
prime ideals? 

38. Form polynomials a(x) G J[x] of least degree having among its zeros: 

(а) a/ 3 and 2 (e) 1 and 2 + 3^ (e) 1 + t of multiplicity 2 

(б) i and 3 (d) -1 + i and 2 - 3i 

Ana. (a) * 3 - 2* 2 - 3x + 6, (d) x* - 2x 3 + lx 2 + 18* + 26 

39. Verify that the minimum polynomial of y/S + 2 i over R is of degree 2 and over Q is of degree 4. 

40. Find the greatest common divisor of each pair a(x), p(x) over the indicated coefficient ring and express 
it in the form a(x) • a(x) + t(x ) • p(x). 

(a) *s + * 4 - * 3 - 3* + 2, *=> + 2* 2 - * - 2 ; Q 

(b) Zx* - 6*3 + 12* 2 + 8* - 6, *3 - 3*2 + 6* - 3 ; Q 

(c) x s - 3i*3 - 2i*3 - 6, *2 - 2i ; C 

(d) * 5 + 3*3 + *2 + 2* + 2, *“ + 3*3 + 3*2 + * + 2 ; 1/(5) 

(e) *5 + *» + *, x* + 2*3 + 2*; //(3) 

(/) ex 4 + hx 3 + a* 2 + gx + e, gx a + hx 2 + dx + g ; S of Example 1(d), Chapter 11, Page 114. 

Ana. ( b) (37*2-108* +177) •«(*) + ^ (-111** + 213* 2 - 318* - 503) • p(x) 

(d) (* +2) •<*(*) + (4*2 + * + 2)-j3(*) 

(/) (gx + h) • a(x) + (ex 2 + hx + e) • p(x) 

41. Prove Theorem XVII, Page 132. 

42. Show that every polynomial of degree 2 in y[*) of Example 11, Page 133, factors completely in 
f[x]/(x 2 + 1) by exhibiting the factors. 

Partial Ans. x 2 + 1 = (* +1)(* + 2{); ** + * + 2 = (* + { + 2)(# + 2f + 2); 

2*2 + * + l = (* + { + l)(2* + { + 2) 

43. Discuss the field Q[*]/(* 3 — 3). (See Example 10, Page 133.) 

44. Find the multiplicative inverse of J 2 + 2 in (o) <?[*]/(** + * + 2), (b) f[x]/(x 2 + * + 1) when f = 1/(5). 
Ana. (a) £(l-2|-f2), (6) J(f + 2) 



Chapter 13 


Vector Spaces 


INTRODUCTION 

In this chapter we shall define and study a type 
of algebraic system called a vector space. Before 
making a formal definition, we recall that in elemen¬ 
tary physics one deals with two types of quantities: 

(a) scalars (time, temperature, speed) which have 
magnitude only and (6) vectors (force, velocity, 
acceleration) which have both magnitude and direc¬ 
tion. Such vectors are frequently represented by 
arrows. For example, consider in Fig. 13-1 a given 
plane in which a rectangular coordinate system has 
been established and a vector £, = OP = (a, 6 ) join¬ 
ing the origin to the point P(a, b). The magnitude 
of (length of OP) is given by r = \/a 2 + b 2 and 
the direction (the angle 6, always measured from the 
positive x-axis) is determined by any two of the Fig. 13-1 

relations sin 6 = b/r, cos d = a/r, tan 6 = 6/a. 

Two operations are defined on these vectors: 

Scalar Multiplication. Let the vector f a = (a, 6) represent a force at 0. The product 
of the scalar 3 and the vector f t defined by 3f ( = (3a, 36) represents a force at O having the 
direction of f, and three times its magnitude. Similarly, — 2f, represents a force at O having 
twice the magnitude of f, but with direction opposite that of f. 

Vector Addition. If = (a, 6) and f 2 = (c, d) represent two forces at O, their result¬ 
ant f (the single force at 0 having the same effect as the two forces f, and f 2 ) is given by 
^ = = (ffl + c, 6 + d) obtained by means of the parallelogram law. 

In the example above it is evident that every scalar s G R and every vector (efixfi. 
There can be no confusion then in using (+) to denote addition of vectors as well as addition 
of scalars. 

Denote by V. the set of all vectors in the plane, (i.e., V = R x R). Now V has a zero 
element f = (0,0) and every £ = (a, 6) G V has an additive inverse —£ = (—a, —6) G V 
such that f + (—f) = £; in fact, V is an abelian additive group. Moreover, for all s,t G R 
and £,t) G V, the following properties hold: 

»(f + r,) = Sf + Sr, (s + t)i = S| + t£ 

s(t£) = (s<){ If = { 

Example 1: Consider the vectors { = (1,2), * = (£, 0), a — (0, -3/2). Then 

(а) 3£ = 3(1,2) = (3,6), 2, = (1,0), and 3{ + 2, = (3,6) + (1,0) = (4,6). 

(б) f + 2, = (2,2), , + , = (f-3/2), and 5(£ + 2,) - 4(, + a) = (8,16). 
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VECTOR SPACES 

Let f be a field and V be an abelian additive group such that there is a scalar multipli¬ 
cation of V by f which associates with each sGf and 4 G V the element s£ G V. Then 
V is called a vector space over f provided, with u the unity of f\ 

(i) s(4 + v) = s£ + s v (iii) s(t£) = ( st)£ 

(ii) (s + 1)£ = s£ + t£ (iv) u£ = £ 

hold for all s,tGf and £,rjGV. 

It is evident that, in fashioning the definition of a vector space, the set of all plane 
vectors of the section above was used as a guide. However, as will be seen from the 
examples below, the elements of a vector space, i.e. the vectors, are not necessarily quantities 
which can be represented by arrows. 

Example 2: (a) Let f = R and V = V 2 (R) - {(a„ o,s) : o„ a? G R} with addition and scalar 

multiplication defined as in the first section. Then, of course, V is a vector 
space over f\ in fact, we list the example in order to point out a simple generali¬ 
zation: Let T = R and V = V n (R) = {(a u a 2 , ...,a n ): a, G R} with addition 
and scalar multiplication defined by 

£ + H = («1, ..«») + (*1> h . b rt) 

= (ttj + 6j, Oi + &2> ...,<*»+fc»), {, v € V n (R) 

and 

s( = *(®i. «2> • ■ •. ®n) - (*°1> set 2> • • •» 8 °n)» «6JJ, (6V n (B) 

Then V n (R) is a vector space over R. 

(6) Let f = R and V„ (O = {(a t , <h, ..., a„): S C} with addition and scalar 

multiplication as in (a). Then V n (C) is a vector space over R. 

(e) Let T be any field, V = f [»] be the polynomial domain in * over f, and define 
addition and scalar multiplication as ordinary addition and multiplication in 
y[*]. Then V is a vector space over f. 

Let f be a field with zero element z and V be a vector space over f. Since V is an 
abelian additive group, it has a unique zero element £ and, for each element £ € V, there 
exists a unique additive inverse —£ such that £ + (—£) = £• By means of the distributive 
laws (i) and (ii), we find for all s Gf and £ G V, 

s£ + z£ = (8 + z)£ = s£ = s£ + £ 
and s£ + s£ = s(4 + £) = s£ = s£ + £ 

Hence, z£ = £ and s£ = £. 

We state these properties, together with others which will be left for the reader to estab¬ 
lish, as 

Theorem I. In a vector space V over f with z the zero element of f and £ the zero element 

of V, we have 

(1) s£ = £ for all s € f 

(2) z£ = £ for all £ G V 

(3) (-s)l = s(-4) = -(*4) for all sGf and (G7 

(4) If s£ = £, then s = z or £ = £ 

SUBSPACE OF A VECTOR SPACE 

A non-empty subset U of a vector space V over f is a subspace of V provided U is itself 
a vector space over f with respect to the operations defined on V. There follows 
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Theorem II. 


A non-empty subset U of a vector space V over f is a subspace of V if and 
only if U is closed with respect to scalar multiplication and vector addition 


as defined on V. 


For a proof, see Problem 1. 


Example 3: Consider the vector space V = V 3 (R) = {(a, b, e ): a, b, e G R} over R. By Theorem 
II, the subset U = {(a, b, 0): a,b G R} is a subspace of V since for all and 

(o, 6,0), (c, d, 0) G U, we have 

(o, b, 0) + (c, d, 0) = (a + e,b + d, 0) G V 




s(a, b, 0) = (sa, sb, 0) G 17 


In Example 3, V is the set of all vectors in ordinary space while U is the set of all such 
vectors in the AOF-plane. Similarly, W = {(a, 0,0) : a £ R} is the set of all vectors along 
the X-axis. Clearly, IF is a subspace of both U and V. 

Let l 1( i 2 , .. .,! m £ V, a vector space over f. By a linear combination of these m vec¬ 
tors is meant the vector | £ V given by 

£ = 2 + c 2 | 2 + ••• + c m ! m , c.£f 

Consider now two such linear combinations 2 c,£ t and Since 

2«A + 2<*A = 2 («,+«*& 

and, for any sGf, *2 c£ = 2 (sc.)^ 

we have, by Theorem II, 

Theorem III. The set U of all linear combinations of an arbitrary set S of vectors of a 
(vector) space V is a subspace of V. 

The subspace U of V defined in Theorem III is said to be spanned by S. In turn, the 
vectors of S are called generators of the space U. 

Example 4: Consider the space V 3 (R) of Example 3 and the subspaces 
17 = {«(1,2,1) + t(3,1,5): s, t G R} 

spanned by = (1,2,1) and ? 2 = (3,1,5) 

and W = {a(l, 2,1) + 6(3,1,6) + c(3, -4,7) : a, 6, c G R} 

spanned by {„ { 2 . and ? 3 = (3, —4,7). 

We now assert that 17 and W are identical subspaces of V. For, since 
(3, —4, 7) = —3(1,2,1) + 2(3,1,5), we may write 

W = {(o - 3e)(l, 2,1) + (6 + 2c)(3,1,5) : a, b, e G R} 

= {8'(1,2,1) + t'(3,1,5) : s', t’ G R} 

= 17 


Let t/ = + fe 2 | 2 + • • • + : k { £ f) 

be the space spanned by S' = {^,| 2 , .. a subset of vectors of V over f. Now U con¬ 

tains the zero vector £ £ V (why?); hence, if £ G S, it may be excluded from S leaving 
a proper subset which also spans U. Moreover, as Example 4 indicates, if some one of the 
vectors, say £ jt of S can be written as a linear combination of other vectors of S, then | may 
also be excluded from S and the remaining vectors will again span U. This raises questions 
concerning the minimum number of vectors necessary to span a given space U and the 
characteristic property of such a set. 

See also Problem 2. 
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LINEAR DEPENDENCE 

A non-empty set S = {£,, | 2 .of vectors of a vector space V over f is called 

linearly dependent over f if and only if there exist elements k v k 2 , . . .,k m of f, not all 
equal to z, such that 

2 Hi = Hx + Kk + • • • + k J m = t 

A non-empty set S = {£,, £ 2 , . .. , | m } of vectors of V over f is called linearly independent 
over T if and only if 

2 Hi = Hi + Hi + • " + k m£m - £ 

implies every k t = z. 

Note. Once the field f is fixed, we shall omit thereafter the phrase “over f"\ moreover, 
by “the vector space V n (Q)’’ we shall mean the vector space V n (Q) over Q, and similarly 
for V n (R). Also, when the field is Q or R, we shall denote the zero vector by 0. Although 

this overworks 0, it will always be clear from the context whether an element of the field 

or a vector of the space is intended. 

Example 5: (a) The vectors f t = (1,2,1) and f 2 = (3,l,5) of Example 4 are linearly inde¬ 

pendent, since if 

*,£, + fc 2 £ 2 = ( fc i + 3fc 2> 2fc i + fc 2- fc i + 5fc 2) = 0 = (0,0,0) 

then fc, + 3fc 2 = 0, + k 2 = 0, k x + 5fc 2 = 0. Solving the first relation for 

fcj = —3fc 2 and substituting in the second, we find —5fc 2 = 0; then fc 2 = 0 and 
k t = —3 k 2 = 0. 

(6) The vectors {j = (1,2,1), { 2 = (3,1,5) and f 3 = (3, -4,7) are linearly dependent, 
since 3{ t — 2& + { 3 = 0. 

There follow 

Theorem IV. If some one of the set S = {!,, | 2 . i m ) of vectors in V over f is the zero 

vector £, then necessarily S is a linearly dependent set. 

Theorem V. The set of non-zero vectors S = {|,, i 2 , .... ! m } of V over f is linearly 
dependent if and only if some one of them, say can be expressed as a 
linear combination of the vectors which precede it. 

For a proof, see Problem 3. 

Theorem VI. Any non-empty subset of a linearly independent set of vectors is linearly 
independent. 

Theorem VII. Any finite set S of vectors, not all the zero vector, contains a linearly inde¬ 
pendent subset U which spans the same vector space as S. 

For a proof, see Problem 4. 

Example 6: In the set S = {«».&.&} of Example 5(6), and are linearly independent while 
{3 = 2£ 2 — 3( t . Thus, T j = {£ 1( { 2 } is a maximum linearly independent subset of S. 
But, since £1 and £ 3 are linearly independent (prove this) while £ 2 = ^(£3 + 3£ t ), 
T 2 = {£i, £ 3} is also a maximum linearly independent subset of S. Similarly, 
T 3 = (£ 2 , £ s ) is another. By Theorem VII each of the subsets T u T 2 , T 3 spans the 
same space as does S. 

The problem of determining whether a given set of vectors is linearly dependent or 
linearly independent (and, if linearly dependent, of selecting a maximum subset of linearly 
independent vectors) involves at most the study of certain systems of linear equations. 
While such investigations are not difficult, they can be extremely tedious. We shall post¬ 
pone most of these problems until Chapter 14 where a neater procedure will be devised. 

See Problem 5. 
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BASES OF A VECTOR SPACE 

A set S = {£,, i 2 , of vectors of a vector space V over f is called a basis of V 

provided: 

(i) S is a linearly independent set, 

(ii) the vectors of S span V. 

Define the unit vectors of V n (f) as follows: 

c, = (tt, 0,0,0, . ..,0,0) 

t 2 = (0, «, 0, 0, ..., 0, 0) 

< 3 = (0,0,«,0, ...,0,0) 

« b = (0,0, 0,0, ...,0,tt) 

and consider the linear combination 

£ = «,*, + a 2<2 + • • • + o„c n = (otj, a 2 .a n ), a { e f (1) 

If £ = L then a t — a 2 = • • ■ — On = z; hence, E = {«,, .. .,e„} is a linearly independent set. 

Also, if £ is an arbitrary vector of V n (f), then ( 1) exhibits it as a linear combination of the 

unit vectors. Thus the set E spans V n (f) and is a basis. 

Example 7: One basis ot V 4 (ft) is the unit basis 

E = {(1,0,0,0), (0,1,0,0), (0,0,1,0), (0,0,0,1)} 

Another basis is 

F = {(1,1,1,0), (0,1,1,1), (1,0,1,1), (1,1,0,1)} 

To prove this, consider the linear combination 

{ = «,(1,1,1,0)+ 02(0,1,1,1) + Oj(l, 0,1,1)+ o 4 (l, 1,0,1) 

= («i + ®3 + «» 4 , <*j + <* 2 + «4> + «2 + a 3 , a 2 + o 3 + o 4 ); a { G R 

If | is an arbitrary vector (p, q, r, s) G V < (R), we find 

«i = (p + q + r - 2s)/3 a 3 = (p + r + s-2g)/3 

a 2 = (<f + r + s — 2p)/3 a 4 = (p + q + s — 2r)/3 

Then F is a linearly independent set (prove this) and spans V 4 (R). 

In Problem 6, we prove 

Theorem VIII. If S = {£,, | 2 , .... | m } is a basis of the vector space V over f and 
T = {^j, v 2 > • • ->V n ) is any linearly independent set of vectors of V, then 


As consequences, we have 


Theorem IX. 
and 


If S — {!,, $ 2 , ..., | m } is a basis of V over f, then any m +1 vectors of V 
necessarily form a linearly dependent set. 


Theorem X. Every basis of a vector space V over f has the same number of elements. 

The number defined in Theorem X is called the dimension of V. It is evident that 
dimension, as defined here, implies finite dimension. Not every vector space has finite 
dimension, as shown in 

Example 8: (a) From Example 7, it follows that V 4 (R) has dimension 4. 

(6) Consider V = {a 0 + a,* + a 2 x* + a 3 x 3 + a 4 x* : a s G R} 

Clearly, B = {1, x, x 2 , x 3 , * 4 } is a basis and V has dimension 5. 


(c) The vector space V of all polynomials in x over R has no finite basis and, hence, 
is without dimension. For, assume B, consisting of p linearly independent 
polynomials of V with degrees - q, to be a basis. Since no polynomial of V of 
degree > q can be generated by B, it is not a basis. g eg p ro y em 7 
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SUBSPACES OF A VECTOR SPACE 

Let V, of dimension n, be a vector space over f and U, of dimension m <n having 
B = {|j,| 2 , ..., | m } as a basis, be a subspace of V. By Theorem VIII, only m of the unit 
vectors of V can be written as linear combinations of the elements of B; hence there exist 
vectors of V which are not in U. Let be such a vector and consider 

k A + *£+"•+ k J m + k V, = (, h, k € f (2) 

Now k = z, since otherwise k~ l G f', 

Vl = fc- l (-*A- fc A- 

and 7 /j G U, contrary to the definition of -q v With k — z, (2) requires each h = z since 
B is a basis, and we have proved 

Theorem XI. If B = {£,,£ z ,.. .,| m } is a basis of U CV and if but *?, & U, then 

B u { 17 J is a linearly independent set. 

When, in Theorem XI, m + 1 = n, the dimension of V, Bi = B U { 77 ,} is a basis of V; 
when m + 1 <n, Bi is a basis of some subspace lh of V. In the latter case there exists 
a vector i ? 2 in V but not in Ut such that the space TJi, having B U {i/j, »? 2 } as basis, is either V 
or is properly contained in V, ... . Thus we eventually obtain 

Theorem XII. If B = {£,, | 2 , ..., | m } is a basis of U CV having dimension n, there exist 
vectors -q v i? 2 , ..., ij n _ m in V such that B U {ij v r/ 2 , ..., v n -m) a basis of V. 

See Problem 8 . 

Let U and W be subspaces of V. We define 

UC\ W = {£: £ <= U, W) 

U+W = {Z + v : £GU, v GW} 
and leave for the reader to prove that each is a subspace of V. 

Example 9: Consider V - V 4 (R) over R with unit vectors «i,e 2 ,e 3 ,« 4 as in Example 7. Let 
U = (Ojei + « 2 e 2 + °3 e 3 • * 1 ^®} 
and W — ( 6 i «2 + & 2«3 + & 3 e 4 : € R} 

be subspaces of dimensions 3 of V. Clearly, 

U n W = 2 + c 2 e 3 : G R} , of dimension 2 

and U+W = {«!«! + « 2 f 2 + ® 3 e 3 + ** 1*2 + ^2 e 3 + ** 3 e 4 : °i> &i S B} 

— + d 2 (2 + d 3 e 3 + d 4 e 4 : d t € R} = V 


Example 9 illustrates 

Theorem XIII. If U and W, of dimension r^n and s^n respectively, are subspaces of a 
vector space V of dimension « and if U n W and U+W are of dimensions 
V and t respectively, then t = r + < - p. For a proof _ see Problem 9 . 


VECTOR SPACES OVER R 

In this section, we shall restrict our attention to vector spaces V= V n (R) over R. 
This is done for two reasons: (1) our study will have applications in geometry and (2) of all 
possible fields, R will present a minimum in difficulties. 
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Consider in V = V2 (R) the vectors £ = (ai, a 2 ) 
and v = (61,62) of Fig. 13-2. The length ||| of £ is 
given by HI = v/of-Taf and the length of 77 is given 
by |»?| = \/&i+ b|. By the law of cosines we have 
H - v \ 2 = III 2 + hi 2 - 2 If I -1^| cos 6 

so that 

cos* = («»+ + (b 1 + bl) - [(ai - ft,)* + (a, - 6 ,)*] 

2 III * hi 

_ ffll&l + tt2&2 

_ III‘hi 

The expression for cos * suggests the definition of 
the scalar product (also called dot product and inner 
product) of £ and 77 by Fig. 13-2 

| • 7 J = fll&l + 02^2 

Then ||| = yj£ • |, cos * = ^ j 7 y , and the vectors | and 77 are orthogonal (i.e., mutually per¬ 
pendicular, so that cos * = 0) if and only if £• 77 = 0. 

In the vector space V = V„ (R) we define for all £ = (a u a 2 ,...,On) and 77 = (b t , 62, ... , bn), 

I • 77 = 5) = ®i6i + ®2&2 + • • • + a„b„ 

III = -y/W = y/a\ + a\+ ••• +0* 

There follow 

( 1 ) |s|| = |»| • ||| for all | € V and sGR 

(2) HI — 0> the equality holding only when £ = 0 

( 3 ) ||*vl — HI'lvl (Schwarz inequality) 

( 4 ) || + ?l ~ HI + M (Triangle inequality) 

( 5 ) £ and 77 are orthogonal if and only if |* 77 = 0 . 

For a proof of ( 3 ), see Problem 10 . 

See also Problems 11 - 13 . 

Suppose in V n (R) the vector 77 is orthogonal to each of | lf | 2 , ..., | m . Then, since 
V * !i = V ‘ l 2 = ‘ ‘ ‘ = V' l m = 0 , we have 77 • (c,^ + c 2 | 2 4 - • • • + c m | m ) = 0 for any c t GR and 
have proved 

Theorem XIV. If, in V n (R), a vector 77 is orthogonal to each vector of the set {£,, £ 2 ,..£ m ), 
then 77 is orthogonal to every vector of the space spanned by this set. 

See Problem 14 . 

LINEAR TRANSFORMATIONS 

A linear transformation of a vector space V(f) into a vector space W{f) over the same 
field f is a mapping T of V{f) into W(f) for which 

(i) (| ; + £ } )T = £.T + £.T for all !,, f, G V(f) 

(ii) (s£.)T = s(f.T) for all £ t G V(f) and sGf 

We shall restrict our attention here to the case W(f) = V{f), i.e. to the case when T 
is a mapping of V{f) into itself. Since the mapping preserves the operations of vector 





150 


VECTOR SPACES 


[CHAP. 13 


addition and scalar multiplication, a linear transformation of V(f) into itself is either an 
isomorphism of V(f') onto V(f) or a homomorphism of V(f) into V(f). 

Example 10: In plane analytic geometry the familiar rotation of axes through an angle a is a 
linear transformation 

T : ( x , y) -» (x cos a - y sin a, x sin a + y cos a) 

of V 2 (R) into itself. Since distinct elements of V t (R) have distinct images and 
every element is an image (prove this), T is an example of an isomorphism of V(R) 
onto itself. 

Example 11: In V a (Q), consider the mapping 

T : (o, 6, c) -» (a + b + 5c, a + 2c; 26 + 6c), (a, 6, e) G V a (Q) 

For (o ,b,c),(d,e,f)eV 3 (Q) and s e Q, we have readily 

(i) (o, 6, c) + (d, e, f) = (a + d, b + e, c + f) 

(o + d + 6 + e + 5c + 5/, o + d + 2c + 2/, 26 4-2e + 6c + 6/) 

= (a + 6 + 6c, a + 2c, 26 + 6c) + (d + e + 5/, d + 2f, 2e + 6/) 
i.e., [(«, 6, c) + (d, e,f)\T = (a, b,e)T + (d, e, f)T 

and 

(ii) s(a, 6, e) = (sa, sb, sc) -» (so + s6 + 6sc, so + 2sc, 2s6 + 6sc) 

= s(a + 6 + 6c, o + 2c, 26 + 6c) 
i.e., [s(o,6,c)]r = s[(a,b,c)T] 

Thus T is a linear transformation on V 3 (Q). 

Since (0,0,1) and (2,3,0) have the same image (6,2,6), this linear transforma¬ 
tion is an example of a homomorphism of V S (Q) into itself. 


The linear transformation T of Example 10 may be written as 

x(l, 0 ) + 1 /( 0 , 1 ) *(cos «, sin «) + y(- sin a, cos a) 

suggesting that T may be given as 

T: (1,0) -» (cos a, sin «), (0,1) -» (- sin o, cos a) 

Likewise, T of Example 11 may be written as 

o(l, 0,0) + 6(0,1,0) + c(0,0,1) -> o(l, 1,0) + 6(1,0,2) 4 - c(5,2,6) 

suggesting that T may be given as 

T : (1,0,0) -*• (1,1,0), (0,1,0) (1,0,2), (0,0,1) -*• (5,2,6) 


Thus we infer: 

Any linear transformation of a vector space into itself can be described completely 
by exhibiting its effect on the unit basis vectors of the space. gee p rob j em 15 


In Problem 16, we prove the more general 

Theorem XV. If {| 1( i 2 , ..., !„} is any basis of V = V{f) and if { Vl , y v ■ ■ ■, v n ) is any 
set of n elements of V, the mapping 

T: (* = 1 , 2 ,...,«) 

defines a linear transformation of V into itself. 



CHAP. 13] 


VECTOR SPACES 


151 


In Problem 17, we prove 

Theorem XVI. If T is a linear transformation of V(f) into itself and if W is a subspace 
of V(f ), then W T = {£T : £ G W}, the image of W under T, is also a sub¬ 
space of V(f'). 

Returning now to Example 11, we note that the images of the unit basis vectors of V 3 ( Q) 
are linearly dependent, i.e., 2 t} T + 3< 2 T - < 3 T = (0,0,0). Thus, V T C V; in fact, since 
(1,1,0) and (1,0,2) are linearly independent, V T has dimension 2. Defining the rank of a 
linear transformation T of a vector space V to be the dimension of the image space V T of V 
under T, we have by Theorem XVI, 

r T = rank of T — dimension of V 

When the equality holds, we shall call the linear transformation T non-singular ; otherwise 
we shall call T singular. Thus T of Example 11 is singular of rank 2. 

Consider next the linear transformation T of Theorem XV and suppose T is singular. 
Since the image vectors v . are then linearly dependent, there exist elements s t G f, not 
all z, such that 2) s iVi = £• Then, for £ = 2 s j£ i( we have £T = f. Conversely, suppose 
7=2 * C and 

vT = 2 (t&)T = tjdjT) + t 2 (| 2 T) + • • • + t n (| n T) = f 
Then the image vectors $.T, (i =1,2,...,«) must be linearly dependent. We have proved 
Theorem XVII. A linear transformation T of a vector space V{f) is singular if and only if 
there exists a non-zero vector | G V(f) such that |T = 

Example 12: Determine whether each of the following linear transformations of V 4 ( Q ) into itself 
is singular or non-singular: 

{ «, - (i,i, o,o) r«j - a,i, o,o) 

^ (0,1,1,0) u - (0,1,1,0) 

e 3 - (0,0,1,!)' (6) ,3^(0,0,1,1) 

u -» ( 0 , 1 , 0 , 1 ) [, 4 -♦ ( 1 , 1 , 1 , 1 ) 

Let { = (o, b,e,d) be an arbitrary vector of V 4 (Q). 

(a) Set |A — (a«i + 6e 2 + c«3 + de 4 )A = (a, a-\-b-\-d, b-\-c, c + d) — 0. Since this re¬ 
quires a — b = e = d = 0, that is, { = 0, A is non-singular. 

(b) Set $B = (o -I- d, a + b + d, b + e + d, c + d) = 0. Since this is satisfied when 

a = e = l, b = 0, d = —1, we have (1,0,1, — 1)B = 0 and B is singular. This 

is evident by inspection, i.e., tl B + e 3 B = e 4 B. Then, since (l B, e 2 B, € 3 B are 
clearly linearly independent, B has rank 3 and is singular. 

We shall again (see the paragraph following Example 6) postpone additional examples 
and problems until Chapter 14. 


THE ALGEBRA OF LINEAR TRANSFORMATIONS 

Denote by oA the set of all linear transformations of a given vector space V{f) over 
T i n to itself and by CM the set of all non-singular linear transformations in cA. Let addition 
(+) and multiplication (•) on cA be defined by 

A+B : £(A + B) = £A + £B, | G V(f) 

and A • B : £(A • B) = (£A)B, £ G V{f) 

for all A,B GcA. Let scalar multiplication be defined on cA by 
kA : i(kA) = (k£)A, £ G V(f) 


for all A G aA and kGf. 
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Example 13: Let 


► (o,6) 

► (c,d) 


and 


► («,/) 

> (g,h) 


be linear transformations of V 2 (R) into itself. For any vector { — (s, t) S V 2 (R), 
we find 

lA = (a,t)A = («€, + U 2 )A = 8(o, b) + t(c,d) 

= (so + tc, s6 + td) 

= ( se + tg, sf + th) 

j(A +B) = (s, t)A + (s, t)B = (s(o + e) + t(e + g), s(b + f) + t(d + h )) 


Thus we have 


A + B : 


J«j -* (o 4- e, b + f) 
[_ e 2 -* (c + g,d+ h) 


Also, 

{(A • B) = ((8, t)A)B = (so + te, sb + td)B 
= (so + te) •(«,/) + (*6 + td) • (g, h) 

= (s(oe + bg) + t(ee + dg), s(o/ + 66) + t(ef + dh)^ 

ftj -* (oe + 6p, o/ + 66) 

A [« 2 -* (ee + dg, cf + dh) 


Finally, for any k € R, we find 

(k£)A = ( ks,kt)A = ( k(sa + tc ), k(sb + td)) 


and 



(, ka , *5) 
(fee, kd) 


In Problem 18, we prove 

Theorem XVIII. The set q4 of all linear transformations of a vector space into itself 
forms a ring with respect to addition and multiplication as defined 
above. 


In Problem 19, we prove 

Theorem XIX. The set (M of all non-singular linear transformations of a vector space 
into itself forms a group under multiplication. 


We leave for the reader to prove 

Theorem XX. If cA is the set of all linear transformations of a vector space V{f) over 
f into itself, then cA itself is also a vector space over f. 

Let A,B G cA. Since, for every £ € F, 

i(A + B) = £A + £B 

it is evident that F <A+B) C V A + V B 

Then dimension of V (A+B) — dimension of V A + dimension of V B 


and 


r A + r i 
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Since for any linear transformation T G cA, 

dimension of V T — dimension of V 
we have dimension of F (A . B) — dimension of V A 

Also, since V A C V, dimension of F (A . B) — dimension of V B 
Thus r (A>B) ^ r A , r (A . B) ^ r B 


Solved Problems 

1. Prove: A non-empty subset U of a vector space V over f is a subspace of V if and 
only if U is closed with respect to scalar multiplication and vector addition as defined 
on V. 

Suppose U is a subspace of V; then U is closed with respect to scalar multiplication and vector 
addition. Conversely, suppose U is a non-empty subset of V which is closed with respect to scalar 
multiplication and vector addition. Let { G U; then (-«)€ = ~(u() = -{£[/ and ( + (-£) = f e U . 
Thus U is an abelian additive group. Since the properties (i)-(iv). Page 144, hold in V, they also hold 
in V. Thus V is a vector space over f and, hence, is a subspace of V. 


2. In the vector space V s (R) over R (Example 3, Page 145), let U be spanned by 
I, = (1,2, -1) and % 2 = (2, -3,2) and W be spanned by | 3 = (4,1,3) and | 4 = (-3,1,2). 
Are U and W identical subspaces of F? 

First, consider the vector { = { 3 - f 4 = (7,0,1) £ W and the vector 

v = xii + Vil = (* + 2 y, 2x - 3 y, -x + 2y) G U 
Now i and y are the same provided x,y G R exist such that 


(* + 2 y, 2x - 3 y, -x + 2y) = (7, 0, 1) 


We find x = 3, y = 2. To be su 
to produce x,y G R such that 

for arbitrary a,b G R. 


s, this does not prove V and W identical; for that we need to be able 
+ Viz = + H* 


[ x + 2y = 4a — 36 

From {-* + 23/ = 3a + 26 W ® find * = i (a ~ 5b) ’ V = Now 2x - 3y # a + 6; 

hence U and W are not identical. 

Geometrically, U and W are distinct planes through O, the origin of coordinates, in ordinary space. 
They have, of course, a line of vectors in common; one of these common vectors is (7,0,1). 


3. Prove: The set of non-zero vectors S = {| 1( | 2 , ..., f m ) of V over f is linearly dependent 
if and only if some one of them, say can be expressed as a linear combination of the 
vectors £,,£ 2 , . . .,^_ 1 which precede it . 

Suppose the m vectors are linearly dependent so that there exist scalars k u k 2 , k m , not all 
equal to z, such that 2 &i£> = £■ Suppose further that the coefficient kj is not z while the coefficients 
+ 1 > k j+2 . k m are z (but not excluding, of course, the extreme case j = m). Then in effect 

fcifi + k 2 Z 2 + • • • + fejlj = f 


( 1 ) 
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and, since kfc # S, we have 

Hi = -fciii - Hi -- 

or ij = 9i4i + 9242 + ••' + 9j-i4j-i ( 2 ) 

with some of the q { ¥= z. Thus, £j is a linear combination of the vectors which precede it. 
Conversely, suppose (2) holds. Then 

fcifi + fc 2 £ 2 4- • • • + kfa + *4j+i + *4 j+ 2 + ••• + *4m = f 

with kj¥= z and the vectors £j,£ 2 ,.. .,( m are linearly dependent. 


4. Prove: Any finite set S of vectors, not all the zero vector, contains a linearly independ¬ 
ent subset U which spans the same space as S. 

Let S = {£„ £ 2 , ..., £ m }. The discussion thus far indicates that U exists while Theorem V suggests 
the following procedure for extracting it from S. Considering each vector in turn from left to right, 
let us agree to exclude the vector in question if (1) it is the zero vector or (2) it can be written as 
a linear combination of all the vectors which precede it. Suppose there are n -m vectors remaining 
which, after relabeling, we denote by U = {v i, vs>.. •»*«}• By its construction, U is a linearly 
independent subset of S which spans the same space as S. 

Example 6, Page 146, shows, as might have been anticipated, that there will usually be linearly 
independent subsets of S, other than V, which will span the same space as S. An advantage of the 
procedure used above lies in the fact that, once the elements of S have been set down in some order, 
only one linearly independent subset, namely U, can be found. 


5 . Find a linearly independent subset U of the set S — {l x , l 2 , l 3 > l 4 }> where 
= (1,2, -1), | 2 = (-3, -6, 3), | 3 = (2,1,3), | 4 = (8,7,7) G R, 
which spans the same space as S. 

First, we note that U * J and move to | 2 . Since £ 2 = -3{ lf (i.e. { 2 is a linear combination of £ t ), 
we exclude { 2 and move to £ 8 . Now £ a # *4i, for any s£K; we move to £ 4 . Since £ 4 is neither a scalar 
multiple of £ 4 nor of £ 3 (and, hence, is not automatically excluded), we write 

8 {j + t| 3 = s(l, 2, —1) + t(2,1,3) = (8,7,7) = | 4 
and seek a solution, if any, in R of the system 

g + 2t = 8, 2s + t = 7, -8 + 3t = 7 

The reader will verify that £ 4 = 2{ 4 + 3| 3 ; hence, U = {£ 1( £ 3 } is the required subset. 


6 . Prove: If S = {£ 1( £ 2 , ..., £ m } is a basis of a vector space V over f and if 
T = {? <i t , rf 2 , ..., r) n } is a linearly independent set of vectors of V, then n — m. 

Since each element of T can be written as a linear combination of the basis elements, the set 

S' = {vi, £i> £2* • • • > 4m) 


spans V and is linearly dependent. Now vi * S', hence some one of the £'s must be a 
tion of the elements which precede it in S'. Examining the £’s in turn, suppose we 
this condition. Excluding £ 4 from S', we have the set 


linear combina- 
find £j satisfies 


S t = Ui, 4i> 4 2 . • • •* 4i i> 4i+i.4m> 

which we shall now prove to be a basis of V. It is clear that spans the same space as S', i.e. S x 
spans V. Thus we need only to show that S x is a linearly independent set. Write 

vt = ®i4i + ®242 + * - ’ + ®»4i> “j e f> a i ¥ ‘ z 
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If S, were a linearly dependent set there would be some (j, j > i, which could be expressed as 
£j = 6 i>7i + &2ii + Hii + ••' + ftjfi-i + H+1U+1 + ••• + 1 > &i v 4 * 

whence, upon substituting for i/ 1 , 

£j = «l£l + C 2i2 + *•• + C,-l ij-l 

contrary to the assumption that S is a linearly independent set. 


Similarly, S[ = (, 2 , {„ £ 2 , ..., £;_„ £ <+1 , ..., £ m } 

is a linearly dependent set which spans V. Since and ij 2 are linearly independent, some one of the 
£’ s in Sj, say £ j( is a linear combination of all the vectors which precede it. Repeating the argument 
above, we obtain (assuming j > i) the set 


as a basis of V. 


S2 — {*2. *j» £i> £2. • • •» £<-i» £m» • • £j-i» £j+i» ..., £ m } 


Now this procedure may be repeated until T is exhausted provided n =£ m. Suppose n> m and 
we have obtained 

S m = (v m . • • •, 1 » 2 > 1 » 1 > 

as a basis of V. Consider S^ = Su{, m+1 }. Since S m is a basis of V and , mtl 6V, then 
ij m+1 is a linear combination of the vectors of S m . But this contradicts the assumption on T. Hence 
n — m, as required. 


(a) Select a basis of Vs ( R) from the set 

S = {«!, fp I 4 ) = {(1. -3.2), (2,4,1), (3,1,3), (1,1,1)} 

(5) Express each of the unit vectors of V 3 (R) as linear combinations of the basis vectors 
found in (a). 

(a) If the problem has a solution, some one of the {’s must be a linear combination of those which 
precede it. By inspection, we find { 3 = {i + £ 2 - To prove {{i,{ 2 ,£ 4 } a linearly independent set 
and, hence, the required basis, we need to show that 

<*£i + Hi + «U = (o + 26 + c, -3a + 46 + e, 2a + b + e) = (0, 0, 0) 
requires a = 6 = c = 0. We leave this for the reader. 

The same result is obtained by showing that 

*£l + ff2 = £4. s,t€R 

s + 2 1 = 1 

-3s + 4t = 1 is impossible. Finally any reader acquainted with determinants will recall 
2s + t = 1 


that these equations have a solution if and only if 


(6) Set a£ t + b( 2 + c{ 4 equal to the unit vectors « 1( e* e 3 
f a + 26 + c = 1 
■j —3a + 46 + c = 0 
[ 2a + 6 + c = 0 
having solutions: 

a = 3/2, 6 = 5/2, c - -11/2 a = 6 = -1/2, c = 3/2 
Thus *! = + 5| 2 - Ilk), *2 = i(- 


12 1 

-3 4 1 =0. 

2 1 1 j 

turn and obtain 
= 0 


6 + c = 0 


—3 a + 46 + c = 0 
2a + 6 + c = 1 


= -1, 6 = —2, c = 5 
£2 + Hi)* and e s = —£j — 2£ 2 + 5£ 4 . 


8 . For the vector space V*(Q) determine, if possible, a basis which includes the vectors 
I, = (3, -2,0,0) and | 2 = (0,1,0,1). 
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the set S = {£ 1; j 2 , «i, « 2 , e 3> *4) spans V 4 (Q) and surely contains a basis of the type we seek. Now e t 
will be an element of this basis if and ofily if 

oil -fc Hz + c*i = (3o + c, -2a + b, 0, 6) = (0, 0, 0, 0) 

requires a = b = e = 0. Clearly, c t can serve as an element of the basis. Again, e 2 will be an element 

if and only if 

ail + Hi + C'l + *2 = (3« + e, -2a + 6 + d, 0, 6) = (0, 0, 0, 0) (1) 

requires a = b = e-d-0. We have 6=0 = 3a + c = -2a + d; then ( 1) is satisfied by a = 1, 

6 = 0, c = -3, d = 2 and so {£1, | 2 , e„ *2) is not a basis. We leave for the reader to verify that 
{£1. £2. «i> *3} is a basis. 

9. Prove: If U and W, of dimensions r^n and s -» respectively, are subspaces of a 

vector space V of dimension n and if V n W and U + W are of dimensions p and t 

respectively, then t — r + s-p. 

Take A = {£„ £ 2 , ..., £ p ) as a basis of U fl W and, in agreement with Theorem XII, Page 148, 

take B = A U {x lf X 2 .X r _ p } as a basis of U and C = A V .. .,/x 3 - p } as a basis of W. 

Then, by definition, any vector of 174- W can be expressed as a linear combination of the vectors of 

D = {£1, £ 2 , ..., £ p , X lf X 2 , ..., X r _ p , iii, /t 2 , ..., lig-p) 

To show that D is a linearly independent set and, hence, is a basis of U 4- W, consider 

®iii + ® 2 £ 2 + * - - + a p £ p 4- b{Ki + 6 2 X 2 4- • " 4- 6 r _ p X r _ p 4- CjMi 4- e ^ 4- ■ ■ • 4- c s _ p /i s - p — f (7) 

where a ( ,bj,c k € f. 

Set r = Cini + c 2 /x 2 4- • • • 4- c,- p n,- p . Now r G W and by (7), v € U\ thus, r S U Cl W and 
is a linear combination of the vectors of A, say, v = d,£i 4- d 2 £ 2 4- • • • 4- d„£ p . Then 

Cini 4- c 2( u 2 4- • • • 4- c t _p/t s _p dj£j — d 2 £ 2 — • • • — dp£ p = I 
and, since C is a basis of W, each c, = z and each d 4 = 2. With each c 4 = z, (7) becomes 

aj£j 4- a 2 £ 2 4- • • • 4- a p £p 4- bjXj 4- & 2 X 2 4- • • • 4- 6 r _ p X r _ p = f (7') 

Since B is a basis of U, each a, = z and each 6 ; = z in (7')- Then D is a linearly independent set 
and, hence, is a basis of V + W of dimension t = r + 8 — p. 


10. Prove: |$ • i/| — |(| * \v\ 
For £ = 0 or * = 0, 1 
and we have 

and 0 


for all |,i/ G V n {R). 

e have 0-0. Suppose £ + 0 and n # 0; then |i?| = k |£| for 
|,|2 = [fc.|{|]2 = fc-ICl’M = fc 2 • |£l 2 = k 2 (t • £) 

^ (fc£±,).(fe£± ,) = fc 2 (£ • £) ± 2 fc(£ • ,) + 1 , • U 

= 2k • |£| • |n| ± 2fc(£ • 1 /) 


Hence 


± 2fc(£ • v) ~ 2fc|£|-M 
*<«•»> ^ |£| * |n| 


and 


l«*i| - ltl*W 


fcGB + , 


11. Given $ = (1,2,3,4) and v = (2,0, -3,1), find 
(a) |-i/, (&) |£| and |i/[, (e) |5$( and |-3i/|, ( d) |$ + »/| 

(a)£-u = 1-24-2-04- 3(—3) 4- 4-1 = -3 

(j b) |£| = Vl-1 + 2-24-3-34-4-4 = x/30, |ij| = x/4 + 9 + 1 = Vl4 

(c) |6£| = V25 + 25 • 4 4- 25 • 9 4- 25 • 16 = 5^30, |-3i/| = V9 • 4 4- 9 • 9 + 9 • 1 = 3^14 

(d) £4-r, = (3,2,0,5) and |£ + n| = V9 + 4 4- 25 = V38 
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12. Given £ = (1,1,1) and , = (3,4,5) in V a (R), 
find the shortest vector of the form a = £ + s,. 

Here, X = (1 + 3s, 1 + 4s, 1 + 5s) 
and |X|* = 3 + 24s + 50s 2 

Now |X| is minimum when 24 + 100s = 0. Hence, 
X = (7/25,1/25, —1/5). We find easily that X and y 
are orthogonal. We have thus solved the problem: 
Find the shortest distance from the point P(l, 1,1) 
in ordinary space to the line joining the origin and 
Q(3,4,5). 



Fig. 13-3 


V 


13. For £ = (ai,a 2 ,a 3 ), , = (b u b 2 , b 3 ) G V 3 (R), define 

£ x , = (a 2 & 3 — a 3 i> 2 , a 3 6 i — ai 6 3 , aib 2 — a 2 bi) 

(а) Show that £ x , is orthogonal to both £ and ,. 

(б) Find a vector A orthogonal to each of £ = (1,1, 1 ) and , = (1, 2, -3). 

(а) {’ (4 X ij) = a t (a 2 6 3 — a 3 6 2 ) + a 2 (a a b l — a t b 3 ) + a 3 (aj6 2 — a 2 6 x ) = 0 and similarly for v ■ (£ X *). 

(б) X = (X, = (l(—3) — 2*1, 1*1 — 1(—3), 1 • 2 — 1 • l) = (-5,4,1) 


14. Let £ — (1, 1 , 1 ,1) and , = (1,2, —3,0) be given vectors in V* (R). 

(а) Show that they are orthogonal. 

( б ) Find two linearly independent vectors A and M which are orthogonal to both £ and ,. 

(c) Find a non-zero vector v orthogonal to each of £, ,, A and show that it is a linear com¬ 
bination of A and p. 


( a ) t ' V — 1‘1 + 1'2-f 1(—3) — 0 ; thus t and y are orthogonal. 

(b) Assume (a,b,e,d) € V t (R) orthogonal to both £ and »; then 

(*) a + b + c + d = 0 and a + 2b - 3c = 0 


First, take c-0. Then a+26 = 0 is satisfied by a = 2, b = — 1; and a+6 + c + d = 0 now 
gives d = —1. We have X = (2,-1,0,-1). 

Next, take 6 = 0. Then a-3c = 0 is satisfied by a = 3, c = 1; and a+6 + c + d = 0 now 
gives d - -4. We have p = (3,0,1, -4). Clearly, x and p are linearly independent. 

Since an obvious solution of the equations (i) is a = b = e-d = 0, why not take X = (0,0,0,0)? 

(c) If r = (a, 6, c, d) is orthogonal to {, y and X, then 


a + 6 + c + d = 0, a + 26 — 3c = 0 and 2a - 6 - d = 0 


Adding the first and last equation, we have 3a + e = 0 which is satisfied by a = 1 c = -3 
Now 6 = —5, d = 7, and r = (1, —5, —3,7). Finally, » = 5X-3 p. 


Note. It should be clear that the solutions obtained here are not unique. First of all, any 
non-zero scalar multiple of any or all of X, p, v is also a solution. Moreover, in finding X (also, ’p) we 
arbitrarily chose c-0 (also, 6 = 0). However, examine the solution in (c) and verify that v is unique 
up to a scalar multiplier. 
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15. Find the image of | = (1,2,3,4) under the linear transformation 

{ £l -» (1,-2,0,4) 
e 2 - ( 2 , 4 , 1 ,- 2 ) 
e 3 - (0,-1, 5,-1) 
e 4 - (1,3,2,0) 

of Vi {Q) into itself. 

We have 

£ = £l + 2e 2 + 3e 3 + 4* 4 -> 

(1, -2,0,4) + 2(2,4,1, -2) + 3(0, -1,5, -1) + 4(1,3,2,0) = (9,15,25, -3) 


16. Prove: If {^,f 2 , ...,£„} is any basis of V = V(f) and if { VvV2 , is any set 

of n elements of V, the mapping 

T: «,->*, (i = 1,2, .. .,n) 
defines a linear transformation of V into itself. 

Let £ = 2 «i£i and n = 2 tj£i be any two vectors in V. Then 

£ + v = 2 («i + *j)£i -* 2(*i+ tihi = 2 s 0i + 2 Uvi 
so that (i) (£ + v)T = £T 4- ijT 

Also, for any jGf and any f£V, 

*£ = * 2 s i£i -» »2 

so that (ii) (#£)T = 8(£T) 

as required. 


17. Prove: If T is a linear transformation of V(f") into itself and if IF is a subspace of 
V(f), then W T = {|T: | G W}, the image of W under T, is also a subspace of V(f). 

For any £7\ r,T G W T , we have £7* + yT = (£ + v)T. Since £, * G W implies £ + n G W, then 
(£ + V )T G W T . Thus W T is closed under addition. Similarly, for any £T G W T , sSf, we have 
s(£T) = (8£)T G W t since £ G W implies «£ G W. Thus W r is closed under scalar multiplication. 
This completes the proof. 


18. Prove: The set cA of all linear transformations of a vector space V(f) into itself forms 
a ring with respect to addition and multiplication defined by 


A + B : t(A + B) = U 4-|£, 
A-B: t(A-B) = ($A)B, 

for all A,B G cA. 

Let £,ij G V(3 r ), fcGf, and A,B,CGcA. Then 

(£ + „)(A + R) = (£ + v)A + (£ + v)B = 

= i(A+B) + v(A+B) 


| G V(f) 

i e V(f) 


£A + j/A + £B + i)B 


and 


(fc£)(A4-B) 


(fc£)A + (fc£)B = fc(£A) + fc(£B) 
fc(£A+£B) = fc£(A + B) 


= [(£ + ,)A]B = (£A + „A)B 
= (£A)B + („A)B = f(A • B) + „(A • B) 


Also, 


(£ + „)(A-B) 
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Thus A + B, A • B G cA and cA is closed with respect to addition and multiplication. 

Addition is both commutative and associative since 

£( A+B ) = iA+iB = £B + (A = £(B + A ) 
and £[(A + B) + C] = i(A + B) + £C = £A + ££ + £C 

= £A+£(B + C) = £[A + (R + 0] 

Let the mapping which carries each element of V(f) into f be denoted by 0; i.e., 

0: £0 = {, teV(f) 

Then 0 £ cA (show this), £(A + 0) = £A + £0 = £ A + f = £A 

and 0 is the additive identity element of cA. 

For each A £ c A, let —A be defined by 

-A: £(-A) = -(£A), £ £ V(f) 

It follows readily that —A £ c A and is the additive inverse of A since 

? = ?A = (£-£)A = £A+ [—(£A)] = £[A + (-A)] = £-0 
We have proved that cA is an abelian additive group. 

Multiplication is clearly associative but, in general, is not commutative (see Problem 55). To com¬ 
plete the proof that cA is a ring, we prove one of the distributive laws 

A • (B + C) = A-B + A-C 
and leave the other for the reader. We have 

£[A-(R + C)] = (fA)(£ + C) = (iA)B + (£A)C 

= £(A-R) + £(A.O = £(A • £ + A • C) 

19. Prove: The set of all non-singular linear transformations of a vector space V(f) 
into itself forms a group under multiplication. 

^Let A,B £ Since A and B are non-singular they map V(f) onto V(f), i.e., V A = V and 
V B - V. Then V ( a.b> = (V A ) B = V B = V and A’B is non-singular. Thus A • B £ <-M and -JW is 
closed under multiplication. The associative law holds in sM since it holds in cA. 

Let the mapping which carries each element of V(f) into itself be denoted by /, i.e., 

I £/ = £, £ £ V(f) 

Evidently, 1 is non-singular, belongs to and since 

£(/-A) = (£/)A = £A = ( { A)7 = £(A • D 
is the identity element in multiplication. 

Now A is a one-to-one mapping of V(f) onto itself; hence it has an inverse A~ i defined by 
A -1 : (£A)A-» = £, £ £ V(f) 

For any £, v £ V{f), A £ <-M, and k £ f, we have £A,,A £ V(f). Then, since 
(£A+,A)A-> = (£ + v)A • A -1 = £ + „=: (£A)A~» + (,A)A~i 
an d [fe(£A)]A-i = [(fc£)A]A-i = &£ = fc[(£A)A-i] 

it follows that A~ l &cA. But, by definition, A~ l is non-singular; hence A -1 £*. Thus each 
element of has a multiplicative inverse and is a multiplicative group. 
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Supplementary Problems 

20. Using Fig. 13-1, Page 143: 

(a) Identify the vectors (1,0) and (0,1); also (a, 0) and (0, b). 

(b) Verify (a, b) = (a, 0) + (0, b) = o(l, 0) + b(0,1). 

21. Using natural definitions for scalar multiplication and vector addition, show that each of the following 
are vector spaces over the indicated field: 

(a) V = R; T = Q (b) V = C ; f = R 

(c) V = {a + by/2 + WS : a.b.e e Q} ; f - Q 

(d) V = all polynomials of degree - 4 over R, including the zero polynomial: T = Q 

(e) V = {c 4 e x + c 2 e 3x : c» e 2 G R) ; f = R 

</) V = {(a 1 ,o 2 ,o 3 ): o, + 2o 2 = 3o 3 }; f = Q 

(g) v = {a+bx: a,b G J/(3)> ; f = //(3) 

22. (a) Why is the set of all polynomials in x of degree > 4 over R not a vector space over R1 

(b) Is the set of all polynomials R[x] a vector space over Q? over C? 

23. Let £,u G V over f and s.tef. Prove: 

(a) When £ ¥* f, then «£ = t£ implies s = t. 

(b) When s ¥* z, then a£ = sji implies £ = <l- 

24. Let {,?»*{£ V over f. Show that £ and i» are linearly dependent if and only if £ = 8ij for some * G f. 

25. (a) Let £,i) € V(R). If £ and a are linearly dependent over R, are they necessarily linearly dependent 

over over C? 

(b) Consider (a) with linearly dependent replaced by linearly independent. 

26. Prove Theorem IV and Theorem VI, Page 146. 

27. For the vector space V = V 4 (R) = {(a, b.c.d) : a, b,e,d& R) over R, which of the following subsets 
are subspaces of VI 

(a) U = ((a, a, a, a) : a 6 8} 

(b) V = {(a, b, a, b): a,bG/} 

(e) U = {(a,2a,b,a+b): a,b G R) 

(d) U = {(a„ a 2 , o 3 , a 4 ): a ( G R, 2a 2 + 3o 3 = 0} 

(e) V = {(Oj, a 2 , a 3 , o 4 ): G R, 2a 2 + 3a s = 5} 

28. Determine whether the following sets of vectors in V 3 (Q) are linearly dependent or independent over Q: 

(a) {(0,0,0), (1,1,1)} (d) {(0,1,-2),(1, -1,1), (1,2,1)} 

(b) {(1,-2,3), (3,-6,9)} ( e ) {(0,2,-4), (1,-2,-1), (1,-4,3)} 

(c) {(1 ,_2,-3),(3,2,1)} (/) {(1,-1,-D, (2,3,1), (-1,4,-2), (3,10,8)} 

Ans. (c), (d) are linearly independent. 

29. Determine whether the following sets of vectors in V 3 (//(5)) are linearly dependent or independent 
over //(5). 

(a) {(1,2,4), (2,4,1)} (c) {(0,1,1), (1,0,1), (3,3,2)} 

(b) {(2,3,4), (3,2,1)} (d) {(4,1,3), (2,3,1), (4,1,0)} 

Ans. (a), (c) are linearly independent. 

30. For the set S = {(1,2,1), (2,3,2), (3,2,3), (1,1,1)} of vectors in V(//(5)) find a maximum linearly inde¬ 
pendent subset T and express each of the remaining vectors as linear combinations of the elements of T. 
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31. Find the dimension of the subset of V 3 (Q) spanned by each set of vectors in Problem 28. 

Ana. ( a),(b) 1; (e),(e) 2; (d),(f) 3 

32. For the vector space C over R, show 
(o) {1,»} is a basis. 

(6) {a + bi, c + di} is a basis if and only if ad — be ^ 0. 

33. In each of the following, find a basis of the vector space which includes the given set of vectors: 

(®) <(1,1,0), (0,1,1)} in V 3 (Q). 

(6) {(2,1, -1, -2), (2,3, -2,1), (4,2, -1,3)} in V 4 (Q ). 

(e) {(2,1,1,0), (1,2,0,1)} in V 4 (//(3)). 

(d) {(t, 0,1,0), (O.t.1,0)} in V 4 (C). 

34. Show that S = ($i, J 2 , { 3 } = {(*, 1 +1, 2), (2 + i, t, 1), (3, 3 + 2i, —1)} is a basis of V 3 (C) and express 
each of the unit vectors of V 3 (C) as a linear combination of the elements of S. 

Ana. ei = [(-39 - 6i)j, + (80 -t){ 2 + (2 - 13f){ s ]/173 

«2 = [(86 - 40i)i, + (-101 + 51t){ 2 + (71 - 29 i)£ 3 ]/346 
*3 = [(104 + 16i)f, + (75 - 55i){ 2 + (-63 - 23»)£ 3 ]/346 

35. Prove: If fej(j + fe 2 £ 2 + fc 3 | 3 = f, where k { k 2 ¥- z, then <(j,( 3 } and <£ 2 , £3} generate the same space. 

36. Prove Theorem IX, Page 147. 

37. Prove: If {f lr | 2 , {3} is a basis of V 3 (Q), so also is <£, + £ 2 , | 2 + ( 3 , { 3 + {,}. Is this true in V 3 (7/(2))? 
In V 3 (//(3))? 

38. Prove Theorem X, Page 147. Hint. Assume A and B, containing respectively m and n elements, are 
bases of V. First, associate S with A and T with B, then S with B and T with A, and apply Theorem 
VIII, Page 147. 

39. Prove: If V is a vector space of dimension n ^ 0, any linearly independent set of n vectors of V is 

40. Let ti,&...,( m 6V and S = {£1,£ 2 .f m } span a subspace UcV. Show that the minimum 

number of vectors of V necessary to span U is the maximum number of linearly independent vectors 
in S. 

41. Let <£i,£ 2 , ...,£„} be a basis of V. Show that every vector £ e V has a unique representation as a 
linear combination of the basis vectors. 

Hint. Suppose £ = Scrfi = 2 d^; then 2 C& ~ 2 d ( (, = f. 

42. Prove: If V and W are subspaces of a vector space V, so also are U n W and U + W. 

43. Let the subspaces U and W of V 4 (Q) be spanned by 

A = {(2,-1,1,0), (1,0,2,1)} and B = {(0,0,1,0), (0,1,0,1), (4,-1,5,2)} 
respectively. Verify Theorem XIII, Page 148. Find a basis of U +W which includes the vectors 
of A; also a basis which includes the vectors of B. 

44. Show that P = {(a, b, — b, a): a,b £ R} with addition defined by 

(a, 6, —6, a) + (c, d, —d, c) = (a + e, b + d, —(6 + d), a + c) 
and scalar multiplication defined by k(a, b, — b, a) - (ka, kb, -kb, ka), for all (a, b, —b, a), (c, d, —d, c)£P 
and fee R, is a vector space of dimension two. 

45. Let the prime p be a prime element of G of Problem 8, Chapter 10. Denote by f the field G/(p) and 

by f the prime field I/(p) of f. The field f, considered as a vector space over f, has as basis <l,i}; 
hence, f = (o, • 1 + o 2 • t: a u a 2 G f'}. (a) Show that f has at most p 2 elements. (6) Show that 

f has at least p 2 elements, (that is, o, • 1 + Ojj • i - b 4 • 1 + 6 2 • i if and only if a, - = « 2 - & 2 = 0) 

and hence exactly p 2 elements. 
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46. Generalize Problem 45 to a finite field f of characteristic p, a prime, over its prime field having n 
elements as basis. 

47. For £, 7), /i G V n (R) and k&R, prove: 

(a) £•„ = „•£, (5) (i + nW = £•<* + n’**, («) (**’’») = *(€-•»)- 

48. Obtain from the Schwarz inequality — 1 — (£• v)/(|{| • M) — 1 to show that cos 6 = *.. . deter¬ 

mines one and only one angle between 0° and 180°. 

49 Let length be defined as in V n (R). Show that (1,1) 6 V 2 (Q) is without length while (l,t) € V 2 (C) 
is of length 0. Can you suggest a definition of £ • v so that each non-zero vector of V 2 (C) will have 
length different from 0? 

50. Let (,v £ V n (R) such that |{| = |„|. Show that £-„ and £ + >, are orthogonal. What is the geo¬ 
metric interpretation? 

51. For the vector space V of all polynomials in a; over a field f, verify that each of the following mappings 
of V into itself is a linear transformation of V. 

(a) a(x) -> a(x) (c) «(*) -» «(-*) 

(ft) a(x) ■* - a(x) (d) «(*) - «(0) 

52. Show that the mapping T : (a, b) -* (e + 1, b + 1) of V 2 (R) into itself is not a linear transformation. 
Hint. Compare (e t + t 2 )T and («jT + « 2 T). 

53. For each of the linear transformations A, examine the image of an arbitrary vector £ to determine 
the rank of A and, if A is singular, to determine a non-zero vector whose image is 0. 


(a) A 


«i - (2,1), 

*2 -* (1,2) 


(b) A 


«, - (3,-4), 

« 2 - (-3,4) 


(e) A 


«, - (1,1,2), 

e 2 - (2,1,3), 

<3 - (1,0,-2) 

(d) A 


«, -> a,-i,i). 

* 2 - (-3,3,-3), 

e 3 - (2,3,4) 

<•) A 


«1 - (0, 1, —1), 

c 2 - (-1,1,1), 

«3 - (1,0,-2) 

if) A 


«, - (1,0,3), 

e 2 - (0,1,1), 

e 3 - (2,2,8) 

Ans. (a) 

non-singular; (6) singular, (1,1); 

(c) non-singular; (d) singular, (3,1,0); (e) singular, (—1,1,1); 


(/) singular, (2,2, —1) 


54. For all A,B e cA and k,l £ f, prove 

(а) kAecA 

(б) k(A + B) = kA + kB; (k + l)A = kA + IA 

(c) k(A’B) = ( kA)B = A(kB ); (k • l)A = k(lA) 

(d) 0 • A = fcO = 0; uA = A 

with 0 defined on Page 159. Together with Problem 18, this completes the proof of Theorem XX, 
Page 152. 

55. Compute B'A for the linear transformations of Example 13, Page 152, to show that, in general, 
A-B^B’A. 


fn 

and B : ■< e 2 -* ( m > n < P) 

s ) 

tj -» (o + j, b + fc, c + l) 
e 2 -» (d + TO, e + TO, / + p) 

[* 3 -* (g + q, h + r, i + s) 


56. For the linear transformations on V 3 (R) 
-» (a, 6, c) 
+ (d,e,f) 
-» (g,k,i) 
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and, for any k&R, 


( aj + bm + eq,ak + bn 4- cr, al + bp + es) 
(dj + em + fq, dk + en + fr, dl + ep + fs) 
(gj + hm + iq, gk + hn + ir, gl + hp + is) 

( «i -* (fco.fc6.fcc) 

£2 -* (fcd.fce.fc/) 

«3 -* ( kg.kh.ki) 


57. Compute the inverse A~i of A : 


(2,3) 


of V 2 (R). 


Hint. Take A ~ 


\e 2 -> (r, 8) 


and consider ({A)A 1 = { where J = (o, 6). 


58. For the mapping , 

T ■ J'» = ( 1 -°) - (1,1,1) 

1 ' \« 2 = (O.D - (0,1,2) 

verify: 

(o) T x is a linear transformation of V into W. 
(6) The image of £ = (2,1) € V is (2,3,4) € W. 

(c) The vector (1, 2,2) eW is not an image. 

(d) V Tl has dimension 2. 


of V=V 2 (R) into W = V S (R) 


59. For the mapping 

T*: 


verify: 


"«, = (1,0,0) 
• e 2 = (0,1,0) 
<3 = (0,0,1) 


(1,0,1,1) 
(0,1,1,1) 
( 1 ,- 1 , 0 , 0 ) 


of V = V 3 (R) into W = V 4 (R) 


(а) T 2 is a linear transformation of V into W. 

(б) The image of £ = (1, —1, -1) e V is (0,0,0,0) e W. 
(c) Vj 2 has dimension 2 and r T ^ = 2. 


60. For T i of Problem 58 and T 2 of Problem 69, verify 


r, • r 2 : 

What is the rank of T x - T 2 1 


U = d,o) 
\«2 = ( 0 . 1 ) 


( 2 , 0 , 2 , 2 ) 
( 2 ,- 1 , 1 , 1 ) 
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Consider again the linear transformations on V 3 (R) 

f tl -»(a,b,c) f-* O', fc. 1) 

.4: -Land B : \t 2 -*(m,n,p) (1) 

[ H -+(g,h,i) U -*(Q>r,s) 

for which (see Problem 56, Chapter 13, Page 162) 

( tj -* (a + j,b + k,c +1) 
t 2 -*■ ( d + m,e + n,f + p ) 
c 3 •* (g + q,h + r,i + s) 

{ t, -» (aj + bm + cq, ak + bn + er, al + bp + cs) 

< 2 -* (dj + em + fq, dk + en + fr, dl + ep + fs) 

< 3 -*• (93 + hm + iq, gk + hn + ir, gl + hp + is) 


-» ( ka,kb,kc) 

h -* (kd,ke,kf) , k G R 
£ 3 -> (kg, kh, ki) 

As a step toward simplifying matters, let the present notation for linear transforma¬ 
tions A and B be replaced by the arrays 


A = 

a b c 1 
d e f 

and B = 

\ 3 k l~j 

\m n p 


g h i 


L* r s j 


effected by enclosing the image vectors in brackets and then removing the excess paren¬ 
theses and commas. Our problem is to translate the operations on linear transformations 
into corresponding operations on their arrays. We have: 

The sum A +B of the arrays A and B is the array whose elements are the sums of 
the corresponding elements of A and B. 

The scalar product kA of k, any scalar, and A is the array whose elements are k 
times the corresponding elements of A. 

In forming the product A • B think of A as consisting of the vectors Pl , p 2 , p s (the row 
vectors of A) whose components are the elements of the rows of A and think of B as con¬ 
sisting of the vectors y v y 2 , y 3 (the column vectors of B) whose components are the elements 
of the columns of B. Then 
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SQUARE MATRICES 

The arrays of the preceding section, on which addition, multiplication, and scalar mul¬ 
tiplication have been defined, are called square matrices ; more precisely, they are square 
matrices of order 3 since they have 3 2 elements. (In Example 1(a), the square matrices are 
of order 2 ). 

In order to permit the writing in full of square matrices of higher orders, we now intro¬ 
duce a more uniform notation. Hereafter, the elements of an arbitrary square matrix will 
be denoted by a single letter with varying subscripts; for example,* 


an ai2 ai 3 

021 022 023 

and B = 

6ll 612 &13 614 
621 622 623 624 

031 032 O33 


&S1 632 633 634 
641 642 643 644 


Any element, say, b 2 * is to be thought of as 62,4 although unless necessary (e.g., b 12 i which 
could be bi 2 ,i or 61.21) we shall not print the comma. One advantage of this notation is that 
each element discloses its exact position in the matrix. For example, the element 624 stands 
in the second row and fourth column, the element 6 32 stands in the third row and second 
column, and so on. Another advantage is that we may indicate the matrices A and B above 
by writing 

A = [av], (i = 1 , 2 ,3; j = 1 , 2 , 3 ) 

B = [6«], (* = 1,2,3,4; j = 1,2,3,4) 


and 
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Then with A defined above and C = 

M, {i,j 

= 1,2,3), 

the product 


2 0,li Cji 

2 

2 Oll c i3 

A‘C = 

2 OLijCji 

2 

2 ®21 C )3 


2 GsjCji 

2 ®3J Cj2 

2 a 3jCjS 


where in each case the summation extends over all values of j; for example, 


2 fflsycj 3 — <i 2 iCi 3 + U 22 C 23 + < 123633 , etc. 

Two square matrices L and M will be called equal, L — M, if and only if one is the 
duplicate of the other; i.e., if and only if L and M are the same linear transformation. Thus 
two equal matrices necessarily have the same order. 

In any square matrix the diagonal drawn from the upper left corner to the lower right 
corner is called the principal diagonal of the matrix. The elements standing in a principal 
diagonal are those and only those (an, < 122 , a 3 3 of A, for example) whose row and column 
indices are equal. 

By definition, there is a one-to-one correspondence between the set of all linear transfor¬ 
mations of a vector space over y of dimension « into itself and the set of all square matrices 
over y of order n (set of all n-square matrices over f). Moreover, we have defined addition 
and multiplication on these matrices so that this correspondence is an isomorphism. Hence, 
by Theorems XVIII and XIX of Chapter 13, Page 152, we have 

Theorem I. With respect to addition and multiplication, the set of all n-square matrices 
over f is a ring % with unity. 


As a consequence: 

Addition is both associative and commutative on %. 

Multiplication is associative but, in general, not commutative on %. 

Multiplication is both left and right distributive with respect to addition. 

There exists a matrix 0„ or 0, the zero element of %, each of whose elements is the zero 


element of f. For example, 0 2 = 
R of orders 2 and 3 respectively. 



1 

O 

O 

O 

[° 0 ( and 0s = 

0 0 0 

|o oj 

0 0 0 


are zero matrices over 


There exists a matrix /„ or I, the unity of %, having the unity of f as all elements along 
the principal diagonal and the zero element of y elsewhere. For example, 1 2 = J"_ 1 


10 0 
0 10 
0 0 1 


e identity matrices over R of orders 2 and 3 respectively. 


For each A = [a«] G %, there exists an additive inverse -A = (-l)[a«] = [-a«] such 
that A + (—A) = 0. 

Throughout the remainder of this book we shall use 0 and 1 respectively to denote the 
zero element and unity of any field (see Page 118). Whenever z and u, originally 
reserved to denote these elements, appear they will have quite different connotations. 
Also, 0 and I as defined above over R will be used as the zero and identity matrices over 
any field f. 


By Theorem XX, Chapter 13, Page 152, we have 
Theorem II. The set of all n-square matrices over f is itself a vector space. 
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A set of basis elements for this vector space consists of the n 2 matrices 
Ea, (*,; = 1,2,3, ...,n) 

of order n having 1 as element in the (i,j) position and 0’s elsewhere. For example, 


{Eh, E 12, E21, E22} — 


°, ° 1 , ° »i [0 0] 

0 oj [0 oj- [1 oj- [0 ij, 

is a basis of the vector space of all 2-square matrices over f ; and for anv A = [ b \ 
we have A = aE n 4 - bE i2 + cE n + dE 22 . “ J 


TOTAL MATRIX ALGEBRA 

The set of all n-square matrices over f with the operations of addition, multiplication 
and scalar multiplication by elements of f is called the total matrix algebra M (T). Now 
just as there are subgroups of groups, subrings of rings, ...., so there are subalgebras of 

r a b 

M n (r). For example, the set of all matrices (M of the form 


2c 

26 2c 


where 


a ’ b ’ C G \ S a subalgebra of at,(Q). All that is needed to prove this is to show that addi- 
!S m . U an f? lar multiplication by elements of Q on elements of a1 invariably 

yield elements of at. Addition and scalar multiplication give no trouble, and so at is I 
of the vector space a /,(Q). For multiplication, note that a basis of at is the set 


I, X = 


0 10 
0 0 1 
2 0 0 


0 0 1 
2 0 0 
0 2 0 


We leave for the reader to show that for A, B G (M, 


A'B — (al + bX + cY)(xI + yX -f zY) 

= (ax + 2bz + 2cy)I + (ay + bx + 2cz)X + (az + by + cx)Y G at; 
also, that multiplication is commutative on Of. 


A MATRIX OF ORDER m x n 

By a matrix over f we shall mean any rectangular array of elements of f ; for example, 


A = 

On Ol2 Ol3 

O2I 022 O23 

, B = 

611 612 b\z 6h 
621 622 623 624 

, C = 

Cn C12 

C21 C22 


031 032 Ossj 


631 632 633 634 


C31 C32 

C41 C42_ 


A = (*» j = 1.2,3) B = [6y], (t = 1,2,3; j = 1,2,3,4) 


C = M. (* = 1 , 2 ,3,4; j = 1 , 2 ) 


Any such matrix of m rows and » columns will be called a matrix of order 
For fixed m and n, consider the set of all matrices over f of order mxn, 
and scalar multiplication defined exactly as for square matrices, we have 


1 x n. 

With addition 


Theorem IF. The set of all matrices over f of order mxnisa vector space over f. 

Chanter 0 !^ 0 “"1 ° nset unless m = n - However, we may, as Problem 60, 
Chapter 13, Page 163, suggests, define the product of certain rectangular arrays For 
example using the matrices A,R,C above, we can form A-B but not B-A; B-CbatZ 
B and neither A -C nor C- A. The reason is clear; in order to form L-M the number 
of columns of L must equal the number of rows of M. For B and C above, we have 
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Pi 


Pi • Yi Pi * Ya 


2 bijCji 2 bijCjz 

B-C = 

[:J 

• [yi yj = 

P 2 *Yi P 2 *Y 2 

Pa’Yi Ps * Y 2 


2 bijCji 2 buCn 
2 &3jCn 2 bijCji 


Thus the product of a matrix of order m x « and a matrix of order n x p, both over the 
same field f, is a matrix of order mxp. gee p ro blems 2-3. 


SOLUTIONS OF A SYSTEM OF LINEAR EQUATIONS 

The study of matrices, thus far, has been dominated by our previous study of linear 
transformations of vector spaces. We might, however, have begun our study of matrices 
by noting the one-to-one correspondence between all systems of homogeneous linear equa¬ 
tions over R and the set of arrays of their coefficients; for example 

(i) 2x + 3y + z = O'! [~2 3 1~| 

(ii) x - y + 4z = 01 and 1-14 ( 3 ) 

(iii) 4x 4- lly — 5z = OJ _4 11 — 

What we plan to do here is to show that a matrix, considered as the coefficient matrix of 
a system of homogeneous equations, can be used (in place of the actual equations) to obtain 
solutions of the system. In each of the steps below, we state our “moves”, the result of 
these “moves” in terms of the equations, and finally in terms of the matrix. 

The given system (3) has the trivial solution x = y = z = 0; it will have non-trivial 
solutions if and only if one of the equations is a linear combination of the other two, i.e. 
if and only if the row vectors of the coefficient matrix are linearly dependent. The procedure 
for finding the non-trivial solutions, if any, is well known to the reader. The set of “moves” 
is never unique; we shall proceed as follows: multiply the second equation by 2 and subtract 
from the first equation, then multiply the second equation by 4 and subtract from the third 
equation to obtain 

(i) — 2(ii) Ox + 5y — 7z = 0 R) 5 -7~| 

(ii) x- y + 4z = 0 1-14 (4) 

(iii) — 4(ii) 0x + 15y-21z = 0 [_0 15 -21_ 

In (-4), multiply the first equation by 3 and subtract from the third equation to obtain 

(i) - 2(ii) Ox + 5y - 7z = 0 TO 5 -7 

(ii) x - y + 42 = 0 1-14 ( 5 ) 

(iii) + 2(ii) - 3(i) Oz + Oy + Oz = 0 |_0 0 0_ 

Finally, in (5), multiply the first equation by 1/5 and, after entering it as the first 
equation in (6), add it to the second equation. We have 

£[(i) - 2(ii)] 0 x+ y- 7z/5 = 0 To 1 -7/5l 

i[3(ii) + (i)] x + 0y + 13z/5 = 0 10 13/5 ( 6 ) 

(iii) + 2(ii) — 3(i) Ox + Oy + Oz = 0 [0 0 0 

Now if we take for z any arbitrary r e R, we have as solutions of the system: 
x = — 13r/5, y = lr/5, z = r. 
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We summarize: from the given system of equations (3), we extracted the matrix 

0 1 —7/5~j 

1 0 13/5 ; 


2 3 1 

1 -1 4 

4 11 -5 


by operating on the rows of A we obtained the matrix B = 


0 0 0 


considering B as a coefficient matrix in the same unknowns, we read out the solutions of 
the original system. We give now three problems from vector spaces whose solutions follow 
easily. 


Example 2: Is {l = (2,1,4), J 2 = (3, -1,11), * 3 = (1, 4 , -5) a basis of V 3 (/?)? 

We set x£j + 3/| 2 + *{3 = (2a; + 3y + z, x — y + 4z, 4* + lly - 5z) = 0 = 

(0,0,0) and obtain the system of equations (3). Using the solution x = -13/5 
y ~ 7 ^ 5, 2 ~ b we h ~ (13/5)ti ~ (7/5)£ 2 - Thus the given set is not a basis. 
This, of course, is implied by the matrix (5) having a row of zeros. 


Example 3: 


Is the set Pl - (2,3,1), P2 _ (1,-1,4), Ps = (4,11,-5) a basis of V a (R)l 

The given vectors are the row vectors of (S). From the record of moves in (5), 
" Ps + 2p 2 — 3 Pl = 0. Thus the set is not a basis. 


we extract (iii) + 2(ii) - 3(i) = 0 < 


Note - The Problems solved in Examples 2 and 3 are of the same type and the com¬ 
putations are identical; the initial procedures, however, are quite different. In Example 2, 
the given vectors constitute the columns of the matrix and the operations on the matrix 
involve linear combinations of the corresponding components of these vectors. In Example 3 
the given vectors constitute the rows of the matrix and the operations on this matrix 
involve linear combinations of the vectors themselves. We shall continue to use the 
notation of Chapter 13 in which a vector of F„(f) is written as a row of elements and 
thus hereafter use the procedure of Example 3 . 


Example 4: Show that the linear transformation 


of V = V & (R) 


We write 


f«i -» (2,3,1) = P1 
T - -U - (1,-1,4) = P2 
l *3 - (4,11, —5) = P3 


is singular and find a vector of V whose image is 0. 


T = 


2 3 1 

1-1 4 

4 11 -5 


Pi 

P2 


By Example 3, Pi) + 2 Pz - 3 Pl = 0; 


thus the image of any vector of V is a linear combination of the vectors Pl and P2 . 
Hence V T lias dimension 2 and T is singular. This, of course, is implied by the 
matrix (5) having a single row of zeros. 

Since 3 Pl — 2 P2 — p 3 = 0, the image of 1 = (3,-2,-1) is 0, i.e. 

2 3 1 

(3,-2,-1)* 1-1 4 =0 

4 11 -5 


Note. The vector (3,— 2 ,- 1 ) may be considered as a 1 
product above is a valid one. 


3-matrix; hence the indicated 
See Problem 4. 


ELEMENTARY TRANSFORMATIONS ON A MATRIX 

In solving a system of homogeneous linear equations with coefficients in f certain 
operations may be performed on the elements (equations) of the system without changing 
its solution or solutions: 

Any two equations may be interchanged. 

Any equation may be multiplied by any scalar 0 of f\ 
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Any equation may be multiplied by any scalar and added to any other equation. 

The operations, called elementary row transformations, thereby induced on the coeffi¬ 
cient matrix of the system are: 

The interchange of the ith and ith rows, denoted by Ha. 

The multiplication of every element of the ith row by a non-zero scalar k, 
denoted by Hi(k). 

The addition to the elements of the ith row of k (a scalar) times the correspond¬ 
ing elements of the ith row, denoted by Hu(k). 

Later we shall find useful the elementary column transformations on a matrix which 
we now list: 

The interchange of the ith and ith columns, denoted by K u . 

The multiplication of every element of the ith column by a non-zero scalar k, 
denoted by Ki(k). 

The addition to the elements of the ith column of k (a scalar) times the cor¬ 
responding elements of the ith column, denoted by Ka(k). 

Two matrices A and B will be called row ( column) equivalent if B can be obtained 
from A by a sequence of elementary row (column) transformations. Similarly, two 
matrices A and B will be called equivalent if B can be obtained from A by a sequence of 
row and column transformations. When B is row equivalent, column equivalent or equiva¬ 
lent to A, we shall write B~ A. We leave for the reader to show that - is an equivalence 
relation. 

Example 5: (a) Show that the set {(1,2,1,2), (2,4,3,4), (1,3,2,3), (0,3,1,3)} is not a basis of 

V 4 (Q). (b) If T is the linear transformation having the vectors of (a) in order as 

images of ej,e 2 ,« 3 ,« 4 , what is the rank of T1 (c) Find a basis of V 4 (Q) containing 
a maximum linearly independent subset of the vectors of (a). 


(a) Using in turn H n (- 2), H ia (- 2), tf 43 (-3); H 42 (2), we have 


12 12 
2 4 3 4 


The set is not a basis. 
(6) Using H 1Z (1), H 32 (-l) 


12 12 
0 0 10 
0 111 
0 3 13 


i the final matrix obtained in (a), we have 


10-10 1000 

0010 _ 0010 
0111 0101 

0000 0000 


Now B has the maximum number of zeros possible in any matrix row equivalent 
to A (verify this). Since B has 3 non-zero row vectors, V r is of dimension 3 and 
r T - 3. 


(c) A check of the moves will show that a multiple of the fourth row was never 
added to any of the other three. Thus the first three vectors of the given set 
are linear combinations of the non-zero row vectors of B. The first three 
vectors of the given set together with any vector not a linear combination of 
the non-zero row vectors of B, for example e 2 or e 4 , is a basis of V 4 (<?). 


Consider the rows of a given matrix A as a set S of row vectors of V„(f) and interpret 
the elementary row transformations on A in terms of the vectors of S as: 
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The interchange of any two vectors in S. 

The replacement of any vector £ G S by a non-zero scalar multiple a£. 

The replacement of any vector £ G S by a linear combination £ + br/ of £ and 
any other vector y G S. 

The foregoing examples illustrate 

Theorem III. The above operations on a set S of vectors of V n (f') neither increase nor 
decrease the number of linearly independent vectors in S. 

See Problems 5-7. 


UPPER TRIANGULAR, LOWER TRIANGULAR AND DIAGONAL MATRICES 

A square matrix A = [a«] is called upper triangular if ay = 0 whenever i > j, and 
is called lower triangular if a« = 0 whenever i < j. A square matrix which is both 


upper and lower triangular is called a diagonal matrix. For example. 


12 3 
0 0 4 
0 0 2 



10 0 


10 0 


-2 0 0 

triangular, 

2 3 0 

3 4 5 

is lower triangular, while 

0 0 0 

0 0 2 

and 

0 3 0 

0 0 1 


are diagonal. 


By means of elementary transformations, any square matrix can be reduced to upper 
triangular, lower triangular, and diagonal form. 

Example 6: 

I 1 2 3 I 

Reduce A = | 4 5 6 | over Q to upper triangular, lower triangular, and diagonal form. 

5 7 8 


(a) Using i? 21 (—4), ff sl <-5); ff 32 (-l), we obtain 


A = 


12 3 

4 5 6 

5 7 8 


i -3 -6 
' -3 -7 


0 -3 -6 
0 0-1 


which is upper triangular. 


(6) Using ff 12 (-2/5), H^-BH); ff 12 (-21/10), H^- 1/28) 

-3/2 0 

1/4 -1/4 


12 3 

4 5 6 

5 7 8 


-3/5 0 3/5 
3/7 0 2/7 


5 7 8 


which is lower triangular. 


(c) Using H 21 (-4), H 31 <—5), tf 32 <-1); H 12 (2/3); H la (- 1), H n (-6) 


1 0-1 
0 -3 -6 
0 0-1 


which is diagonal. 

See also Problem 8. 


A CANONICAL FORM 
In Problem 9, we prove 

Theorem IV. Any non-zero matrix A over f can be reduced by a sequence of elementary 
row transformations to a row canonical matrix (echelon matrix) C having 
the properties: 

(i) Each of the first r rows of C has at least one non-zero element; the 
remaining rows, if any, consist entirely of zero elements. 
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(ii) In the ith row (i = 1,2, ..r) of C, its first non-zero element is 1. 
Let the column in which this element stands be numbered ju 

(iii) The only non-zero element in the column numbered j it (i = 1,2, . . ,,r) 
is the element 1 of the ith row. 

(iv) ji<h< ■■■ < h. 


(a) The matrix B of Problem 6, Page 187, is a row canonical matrix. The first 
non-zero element of the first row is 1 and stands in the first column, the first 
non-zero element of the second row is 1 and stands in the second column, the 
first non-zero element of the third row is 1 and stands in the fifth column. 
Thus, 3 i = 1, j 2 — 2, j 3 = 5 and j x < j 2 < h is satisfied. 


(ft) The matrix B of Problem 7, Page 187, fails to meet condition (iv) and is not a 


row canonical matrix. It may, however, be reduced to 


0 

0 

0 

_0 



a row canonical matrix, by the elementary row transformations H l2 , H l3 . 


In Problem 5, Page 186, the matrix B is a row canonical matrix; it is also the identity 
matrix of order 3. The linear transformation A is non-singular; we shall also call the 
matrix A non-singular. Thus, 

An n-square matrix is non-singular if and only if it is row equivalent to the 
identity matrix /„. 


Any n-square matrix which is not non-singular is called singular. The terms singular and 
non-singular are never used when the matrix is of order m x n with m¥-n. 

The rank of a linear transformation A is the number of linearly independent vectors 
in the set of image vectors. We shall call the rank of the linear transformation A the 
row rank of the matrix A. Thus, 

The row rank of an mxn matrix is the number of non-zero rows in its row 
equivalent canonical matrix. 


It is not necessary, of course, to reduce a matrix to row canonical form to determine its 
rank. For example, the rank of the matrix A in Problem 7 can be obtained as readily 
from B as from the row canonical matrix C of Example 7(6). 


ELEMENTARY COLUMN TRANSFORMATIONS 

Beginning with a matrix A and using only elementary column transformations, we 
may obtain matrices called column equivalent to A. Among these is a column canonical 
matrix D whose properties are precisely those obtained by interchanging “row” and 
“column” in the list of properties of the row canonical matrix C. We define the column 
rank of A to be the number of columns of D having at least one non-zero element. Our 
only interest in all of this is 

Theorem Y. The row rank and the column rank of any matrix A are equal. 

For a proof, see Problem 10. 

As a consequence, we define 

The rank of a matrix is its row (column) rank. 

Let a matrix A over f of order mxn and rank r be reduced to its row canonical form C. 
Then using the element 1 which appears in each of the first r rows of C and appropriate 
transformations of the type K l} (k), C may be reduced to a matrix whose only non-zero 
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elements are these l’s. Finally, using transformations of the type Ka, these l’s can be 
made to occupy the diagonal positions in the first r rows and first r columns. The resulting 
matrix, denoted by N, is called the normal form of A. 


Example 8: 

(a) In Problem 4 we have 


A 


1220 104 -2 

2531 _ 0 1-1 1 

3842 0000 

2713 0000 


Using K 31 (— 4), K 41 ( 2); K 32 (\), K i2 (— 1) on C, we obtain 


A 



1000 1000 
0 1-1 1 _ 0100 
0000 0000 

0000 0000 


c 


the normal form. 


(6) The matrix B is the normal form of A in Problem 5. 


( c ) For the matrix of Problem 6, we obtain using on B the elementary column transformations 
«- 31 (-4), *32(1), *42<-2); * 35 


A 


0 0 
2 0 
0 1 


1 0 0 0 0 
0 10 0 0 
0 0 10 0 


lh 0] 


Note. From these examples it might be thought that, in reducing A to its normal 
form, one first works with row transformations and then exclusively with column trans¬ 
formations. This order is not necessary. g ee p ro blem 11. 


ELEMENTARY MATRICES 


The matrix which results when an elementary row (column) transformation is applied 
to the identity matrix /„ is called an elementary row ( column) matrix. Any elementary 
row (column) matrix will be denoted by the same symbol used to denote the elementary 
transformation which produces the matrix. 


Example 9: 

When 1 = 


10 0 
0 10 
0 0 1 





0 0 1 


10 0 


10 0 

h 13 = 

0 1 0 
_! 0 oj 

= *, 3, H 2 (k) = 

0*0 

0 0 1 

= K 2 (k), H 23 (k) = 

0 1 k 

0 0 1 


By Theorem III, we have 

Theorem YI. Every elementary matrix is non-singular, 
and 

Theorem VII. The product of two or more elementary matrices is non-singular. 


There follows easily 

Theorem VIII. To perform an elementary row (column) transformation H ( K) on a 
matrix A of order mx n, form the product H • A (A • K) where H (K) 
is the matrix obtained by performing the transformation H ( K ) on I. 
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The matrices H and K of Theorem VIII will carry no indicator of their orders. If 
A is of order mxn, then a product such as Hu’ A • K 2 a(k) must imply that Hu is of order m 
while K 2 a(k) is of order n since otherwise the indicated product is meaningless. 


Example 10: Given A = 


12 3 4 
5 6 7 8 
2 4 6 8 


over Q, calculate 


(a) H la -A 


0 0 1 1 2 3 4 
0 10 • 5 6 7 8 
1 0 0 2 4 6 8 


2 4 6 8 
5 6 7 8 
12 3 4 


(b) tf,(-3)-A 


-3 0 0 1 2 3 4 

0 10 • 5 6 7 8 

0 0 1 2 4 6 8 


-3 -6 -9 -12 
5 6 7 8 

2 4 6 8 


(c) A • *4,1-4) 


12 3 4 
5 6 7 8 
2 4 6 8 


10 0-4 
0 10 0 
0 0 10 
0 0 0 1 


12 3 0 
5 6 7 -12 
2 4 6 0 


Suppose now that Hi,H 2 , .. .,H, and Ki,K 2 , .. .,K t are sequences of elementary trans¬ 
formations which when performed, in the order of their subscripts, on a matrix A reduce 
it to B, i.e., 

H s ’ ... • H 2 ’ Hi • A • Ki • K.2’ ... ‘ K t = B 


Then, defining S = H s - 


’H*’Hi and T = K X ’K 2 ’. 
S’A’T = B 


• K t , we have 


Now A and B are equivalent matrices. The proof of the converse 

Theorem IX. If A and B are equivalent matrices there exist non-singular matrices S 
and T such that S’A’T = B. 
will be given in the next section. 

As a consequence of Theorem IX, we have 

Theorem IX'. For any matrix A there exist non-singular matrices S and T such that 
S’A'T = N, the normal form of A. 


Example 11: Find non-singular matrices 5 and T c 
2 -1 


• T = N, the normal form of A. 


Using ff 21 (—3), H a i(— 4), X 21 (—2), *3,(1), we find A = 

fl 0 0 


using H 23 (—1), we obtain A ~ 
Thus, 


10 0 
0 10 
0 0 1 


0 12 
0 13 


10 0 
0 2 5 
0 13 


and finally H 32 (— 1), A 32 (—2) yield the normal form 


H 32 (- 1) • tf 23 (-1) • tf 31 (—4) • ff 21 (—3) • A ■ K 2l (- 2) • * 3l (l) • * 32 (—2) 
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10 0 12-1 
1 1-1*3 8 2 

-5 -1 2 4 9 -1 


1-2 5 10 0 

0 1-2 = 0 10 

0 0 1 0 0 1 


An alternate procedure is as follows: We begin with the array 


A I 3 


10 0 
0 10 

0 0 1 

12-1100 
3 8 2 0 1 0 

= 4 9 -1 0 0 1 


and proceed to reduce A to I. In doing so, each row transformation is performed on the rows of 
six elements and each column transformation is performed on the columns of six elements. Using 


H ai (-3), *„(—2), *„<!); H ia (- 

-i); 

H 32 (-l); K a2 (- 2) 

* 

'e obtain 


1 

0 0 


1 

0 

0 


1 - 

-2 

1 



0 

1 0 


0 

1 

0 


0 

1 

0 



0 

0 1 


0 

0 

1 


0 

0 

1 



1 

2 -1 

1 0 

0 1 

2 

-1 

l 

0 0 1 

0 

0 

1 

0 0 

3 

8 2 

0 1 

0 0 

2 

5 

-3 

10 0 

2 

5 

-3 

1 0 

4 

9 -1 

0 0 

1 - 0 

1 

3 

-4 

0 1 -> 0 

1 

3 

-4 

0 1 


1 - 

-2 1 



1 - 

-2 

1 


1 - 

-2 

5 


0 

1 0 



0 

1 

0 


0 

1 - 

-2 


0 

0 1 



0 

0 

1 


0 

0 

1 


1 

0 0 

10 0 


1 

0 

0 10 0 


1 

0 

0 10 0 


0 

1 2 

1 1 -1 


0 

1 

2 11-1 


0 

1 

011-1 T 

-• 

* 0 

1 3 - 

-4 0 1 

- 

0 

0 

1-5-1 2 -* 

0 

0 

1-5-1 2 = / S 

and 

S-A 

• T = I, 

as before. 








See also Problem 12. 


INVERSES OF ELEMENTARY MATRICES 

For each of the elementary transformations there is an inverse transformation, that is, 
a transformation which undoes whatever the elementary transformation does. In terms 
of elementary transformations or of elementary matrices, we find readily 
tfy 1 = Hu = Ka 

Hr 1 (ft) = Hi{\/k) Kr\k) = Ki{l/k) 

Hv'(k) = Ha(—k) Ky l {k) = K„(-k) 

Thus, 

Theorem X. The inverse of an elementary row (column) transformation is an elemen¬ 
tary row (column) transformation of the same order. 

and 

Theorem XI. The inverse of an elementary row (column) matrix is non-singular. 

In Problem 13, we prove 

Theorem XII. The inverse of the product of two matrices A and B, each of which has an 
inverse, is the product of the inverses in reverse order, that is, 

(A*S)-> = 
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Theorem XII may be extended readily to the inverse of the product of any number of 
matrices. In particular, we have: 

if S = H s - ... -Hz-H i then S" 1 = H r 1 • H 2 l ■ ... • H7 1 , 
if T = Kt ■ Kz • ... • K t then T~ l = K^ 1 • ... ■ K~ x • Xf 1 . 

Suppose A of order mxn. By Theorem IX', there exist non-singular matrices S of 
order m and T of order n such that S' A'T = N, the normal form of A. Then 
A = S~ l (S-A'T)T- 1 = S~ l 'N'T~ l 

In particular, we have 

Theorem XIII. If A is non-singular and if S'A-T - I, then 

A = S~ l 'T~ l 

that is, every non-singular matrix of order n can be expressed as a 
product of elementary matrices of the same order. 


Example 12: In Example 11, we have 


S 

Then 

S- 1 


= tf 32 (-l) • H b (- 1) • H 3 i(- 4) * #2i(-3) and 
= H~ l (-3) • H~ j 1 (-4) • H^ 1 (-D • H~ l (-1) 


T = K 2l (- 2) • K 31 (l) • K„(-2) 

= H 2l (Z) ' H 31 ( 4) • tf 23 ( 1) • H 32 ( 1) 


10 0 
3 10 
0 0 1 


10 0 
0 10 
4 0 1 


10 0 
0 11 
0 0 1 


10 0 
0 10 
Oil 


10 0 

3 2 1 

4 11 


T ~ 1 §§ K 32 (2) ' K n (-1) ' K 2l (2) = 


10 0 
0 12 
0 0 1 


and A = S ~ 1 • T~ l - 


10 0 

3 2 1 

4 11 


1 2-1 

0 1 2 

0 0 1 


Suppose A and B over f of order m x n have the same rank. Then they have the same 
normal form N and there exist non-singular matrices Si, TV, S 2 , T 2 such that S 1 AT 1 = 
N = S 2 BT 2 . Using the inverse Si" 1 and 7Y‘ of Si and T u we obtain 

A = S^'SiATi'T^ 1 = Sr l 'S 2 BT2'Tr 1 = (S^ 1 'S2)B(T 2 'T; 1 ) = S'B'T 
Thus A and B are equivalent. We leave the converse for the reader and state 
Theorem XIV. Two mx« matrices A and B over f are equivalent if and only if they 
have the same rank. 


THE INVERSE OF A NON-SINGULAR MATRIX 

The inverse A -1 , if it exists, of a square matrix A has the property 
A-A~ l = A l 'A = I 

Since the rank of a product of two matrices cannot exceed the rank of either factor 
(see Chapter 13), we have 

Theorem XV. The inverse of a matrix A exists if and only if A is non-singular. 

Let A be non-singular. By Theorem IX' there exist non-singular matrices S and T 
such that S'A'T = /. Then A = S~ l 'T~ l and, by Theorem XII, 

A-i = (S-i.T-i)- 1 = T'S 
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Example 13: 

Using the results of Example 11, we find 


A -1 = T'S = 


1 1-1 
-5 -1 2 


In computing the inverse of a non-singular matrix it will be found simpler to use 
elementary row transformations alone. 


of Example 11 using only elementary r 


[A /] 



2 -1 1 
8 2 0 
9-1 0 


0 0 12-11 
1 0 - 0 2 5 -3 

0 1 0 13-4 



2-1100 

13-401 

25-310 


1 0-7 9 0-2 

013-401 
0 0-1 5 1-2 


1 0 0 -26 -7 12 

0 1 0 11 3 -5 = [I A-»]. 

0 0 1-5-1 2 

See also Problem 14. 


MINIMUM POLYNOMIAL OF A SQUARE MATRIX 

Let A ^0 be an n-square matrix over f. Since A G the set {I, A, A 2 , .. .,A n2 } 

is linearly dependent and there exist scalars a»,a,,a 2r ..a n2 not all 0 such that 

<t>(A) = cioI + di A + a 2 A 2 + • • • + a n2 A n2 = 0 

In this section we shall be concerned with that monic polynomial m(A) G ^[A] of 
minimum degree such that m(A) = 0. Clearly, either m(A) = or m( A) is a proper 
divisor of <j>( A). In either case, m( A) will be called the minimum polynomial of A. 

The most elementary procedure for obtaining the minimum polynomial of A ¥= 0 involves 
the following routine: 

(1) If A = a<>I, ao G f, then m( A) — A — Oo. 

(2) If A ¥= al for all a G f but A 2 = aiA + a 0 I with a 0 , ai G f, then m( A) = A 2 — 

«iA — ao. 

(3) If A 2 aA + bl for all a,b Gf but A 3 = a 2 A 2 + aiA + ad with a 0 , a u a 2 G f, 
then m( A) = A 3 — a 2 A 2 — aiA — a 0 

and so on. 


Example 15: 

Find the minimum polynomial of A 
Since A ¥* a 0 I for all a„ € Q, set 


12 2 
2 12 
2 2 1 


9 8 8 


1 2 2 


1 0 o' 


«! + a„ 2o 1 2 Oj 

8 9 8 

= ®i 

2 12 

+ ®o 

0 10 

= 

2a t o t 4- a 0 2a t 

8 8 9 


2 2 1 


0 0 1 


2oj 2aj a t + 


After checking every entry, we conclude that A 2 = 4A + 6/ and the minimum polynomial is 

See also Problem 15. 
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Example 15 and Problem 15 suggest that the constant term of the minimum polynomial 
of A ¥* 0 is different from zero if and only if A is non-singular. A second procedure for 
computing the inverse of a non-singular matrix follows. 

Example 16: Find the inverse A -1 , given that the minimum polynomial of A 
(see Example 16) is X 2 — 4X — 5. 

Since A 2 - 4A - 5/ = 0 we have, after multiplying by A 1 , A - 4/ - 5A - 1 = 0; 
-3/5 2/5 2/5 

hence, A~» = £(A - 4/) = 2/5 -3/5 2/5 . 

2/5 2/5 -3/5 

SYSTEMS OF LINEAR EQUATIONS 

Let f be a given field and xi, *2 .x„ be indeterminates. By a linear form over f 

in the n indeterminates we shall mean a polynomial of the type 
0*1 + 6x2 + • • ■ + PXn 

in which a,6,...,p6f. Consider now a system of m linear equations 

CluXl + (IJ2X2 + • • • + OinXn = hi 

O21X1 + 022X2 + • • • + 02 nX n = hi (7) 


OmlXi + Om2X2 + ■ • • + OmnX„ = k m 

in which both the coefficients an and the constant terms hi are elements of f. It is to be 
noted that the equality sign in (7) cannot be interpreted as in previous chapters since in 
each equation the right member is in f but the left member is not. Following common 
practice, we write (7) to indicate that elements n,r%,.. .,r n Gf' are sought such that 
when x< is replaced by n, (i = 1,2,.. .,n), the system will consist of true equalities of 
elements of f. Any such set of elements n is called a solution of (7). 

Denote by A = [a#], (i = 1,2, ..., m; j = 1,2, ...,») 

the coefficient matrix of (7) and by S = {£,,the set of row vectors of A. Since 
the are vectors of V n (f), the number of linearly independent vectors in S is r^n. With¬ 
out loss of generality, we can (and will) assume that these r linearly independent vectors 
constitute the first r rows of A since this at most requires the writing of the equations 
of (7) in some different order. 

Suppose now that we have found a vector p — (r t ,r 2 , .. .,r B ) E V n (?') such that 
i l .p = h 1 , t 2 'P = h v .... Z r -p = h T 

Since each | 4 , (i = r +1, r + 2, ..., m) is a linear combination with coefficients in f of the 
r linearly independent vectors of A, it follows that 

$ r+1 ‘P — h r+l , £ r+2 ‘P = h r+2 , ..., £ m -p = h m (8) 

and Xi = n, x 2 = r 2 , ..., x» = r« is a solution of (7) if and only if in (8) each hi is the same 
linear combination of h u h 2 , .. ,,h r as £. is of the set £,,| 2 , that is, if and only if 

®11 O 12 . . . ttln kl 

the row rank of the augmented matrix [A //] = 0,21 0,22 ’ " a2n is also r. 



dml di 1 


dmn ft* 
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We have proved 

Theorem XVI. A system (7) of m linear equations in n unknowns will have a solution if 
and only if the row rank of the coefficient matrix A and of the augmented 
matrix [A H] of the system are equal. 


Suppose A and [A H] have common row rank r<n and that [A H] has been reduced 
to its row canonical form 


1 

0 

0 . 

. 0 

Ci,r+1 

Ci.r + 2 

. . Cin 

fcl 

0 

1 

0 . 

. 0 

C2..-+1 

C2.r+2 

. . C in 

ki 

0 

0 

0 . 

. 1 

Cr,r + 

Cr, r + 2 

. . Cm 

kr 

0 

0 

0 . 

. 0 

0 

0 

. . 0 

0 

0 

0 

0 . 

. 0 

0 

0 

.. 0 

0 


Let arbitrary values s r +i,Sr+2, be assigned to x r +uX r +2, .. then 

X\ — k\ — Ci.r + l'Sr+l — Cl,r + 2*«r + 2 — ••• — Cm’Sn 

Xi = ki — C2,r+l*Sr + l — C2,r + 2*Sr + 2 — ••• — Cin’S„ 

X r = k r ~ Cr,r+l*Sr + l — C r ,r + 2 * Sr + 2 — ••• — Cm ‘ Sn 

are uniquely determined. We have 

Theorem XVI'. In a system (7) in which the common row rank of A and [A H] is r < n, 
certain n-r of the unknowns may be assigned arbitrary values in f 
and then the remaining r unknowns are uniquely determined in terms of 
these. 


Systems of Non-homogeneous Linear Equations 
We call (7) a system of non-homogeneous linear equations over f provided not every 
hi = 0. To discover whether or not such a system has a solution as well as to find the 
solution (solutions), if any, we proceed to reduce the augmented matrix [A H] of the system 
to its row canonical form. The various possibilities are illustrated in the examples below. 


Example 17: 

Consider the system 

We have 


) *j + 2*2 — 3*3 + *4 = 1 

2*4 - *2 + 2*3 - * 4 = 1 
4*j + 3*2 — 4*3 + *4 = 2 




Although this is not the row canonical form, we 
r A = 2 < 3 = 

and the system is incompatible, i.e., has no solution. 


Example 18: 

Consider the system 


{ *i + 2*2 — *3 = — 1 

3*j + 8*2 + 2*3 = 28 
4*, + 9*2 - * 3 = 14 


over Q. 
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We have 

[A H] = 


12-1 -1 
3 8 2 28 

49-1 14 


18 


1 


Here, r A = r [A H] = 3 = the number of unknowns. There is one and only one solution: 
*2 = 3 » *3 = 5 . 

Example 19: 

I *! + X 2 + * 3 + *4 + *5 = 3 

2 *. + 3*2 + 3*3 + * 4 — *5 = 0 - 

Consider the system < , „ _ . „ , over Q. 

-*, + 2* 2 - 5*3 + 2* 4 - * 5 = 1 

[ 3 *, — *2 + 2*3 — 3*4 — 2*5 = —1 

We have 


[A H] = 


2 3 


1 1 
1 -1 


1 


1 


0 -4 -1 -6 -5 


1 

0 

0 

2 

4 

9 


"i 

0 

0 

2 

4 

9 _ 


0 

1 

1 

-1 

-3 

-6 


0 

1 

1 

-1 

-3 

—6 


0 

0 

-7 

6 

9 

22 


0 

0 

-1 

-14 

-25 

-46 


0 

0 

3 

-10 

-17 

-3 4j 


0 

0 

3 

-10 

-17 

-34_ 


: l 

0 

0 

2 

4 

9 1 


1 

0 

0 

2 

4 

9~ 


0 

1 

1 

-1 

-3 

-6 


0 

1 

0 

-15 

-28 

-52 


0 

0 

1 

14 

25 

46 


0 

0 

1 

14 

25 

46 


0 

0 

3 

-10 

-17 

-34 


0 

0 

0 

-52 

-92 

-172 


I 





- 



r 






1 

0 

0 

2 

4 


9 1 


1 

0 

0 

0 

6/13 

31/13 

0 

1 

0 

-15 

-28 

-52 


0 

1 

0 

0 - 

-19/13 

-31/13 

0 

0 

1 

14 

25 

46 


0 

0 

1 

0 

3/13 

-4/13 

0 

0 

0 

1 

23/13 

43/13 


0 

0 

0 

1 

23/13 

43/13 


Here both A and [A H] are of rank 4; the system is compatible, i.e., has one or more solutions. 
Unlike the system of Example 18, the rank is less than the number of unknowns. Now the given 


system is equivalent to 


*!+&* 5 = 31/13 
* 2 -if *5 = -31/13 
*3+^*5 = —4/13 
*4 +f§* 5 = 43/13 


and it is clear that if we assign to * 5 any value 


r S Q then *j = (31 - 6r)/13, * 2 = (-31 + 19r)/13, * 3 = (-4 - 3r)/13, * 4 = (43 - 23r)/13, * 5 = r 
is a solution. For instance, *j = 1, *2 = 2, * 3 = —1, * 4 = —2, * 5 = 3 and x t = 31/13, * 2 = —31/13, 
* 3 = —4/13, * 4 = 43/13, *5 = 0 are particular solutions of the system. 

See also Problems 16-18. 


These examples and problems illustrate 

Theorem XVII. A system of non-homogeneous linear equations over f in n unknowns 
has a solution in f if and only if the rank of its coefficient matrix is 
equal to the rank of its augmented matrix. When the common rank 
is n, the system has a unique solution. When the common rank is r < n, 
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certain n-r of the unknowns may be assigned arbitrary values in f 
and then the remaining r unknowns are uniquely determined in terms 
of these. 

When m = n in system (7), we may proceed as follows: 


(i) Write the system in matrix form 

Oil Ol2 . , 

021 022 • ■ 

. . Oln 

. . 02 n 

"Xl" 

X2 


hi 

h 2 

or, more com- 


Onl On2 . . 

. Onn 

Xn 





pactly, as A’X = H where X is the n x 1 matrix of unknowns and H is the n x 1 
matrix of constant terms. 


(ii) Proceed with the matrix A as in computing A~ l . If, along the way, a row or column 
of zero elements is obtained, A is singular and we must begin anew with the matrix 
[A H] as in the first procedure. However, if A is non-singular with inverse A~ i , then 
A~\A-X) = A~'-H and X = A~'-H. 


Example 20: 

For the system of Example 18, 


8 have from Example 14, A~ l = 


-26 -7 12 
11 3 -5 


-5 -1 

and we obtain the unique solution 


Systems of Homogeneous Linear Equations 

We call (7) a system of homogeneous linear equations provided each hi = 0. Since then 
the rank of the coefficient matrix and the augmented matrix are the same, the system always 
has one or more solutions. If the rank is n, then the trivial solution xi = x 2 = • ■ • = x„ = 0 
is the unique solution; if the rank is r <n, Theorem XVI' assures the existence of non-trivial 
solutions. We have 

Theorem XVIII. A system of homogeneous linear equations over f in n unknowns always 
has the trivial solution x, = x 2 - ■ ■ • = x n = 0. If the rank of the 
coefficient matrix is n, the trivial solution is the only solution; if the rank 
is r <n, certain n~r of the unknowns may be assigned arbitrary values 
in f and then the remaining r unknowns are uniquely determined in 
terms of these. 

{ *i + 2* 2 — * 3 = o 
3* t + 8*2 + 2*3 = 0 over Q. 

4 *! + 9*2 —*3 = 0 

By Example 18, A ~ I 3 . Thus, * t = * 2 = * 3 = 0 is the only solution. 


{ *i + *2 + *3 + *4 = 0 
2*j + 3*2 + 2*3 +* 4 = 0 over Q. 

3*i + 4*2 + 3*3 + 2*4 = 0 


We have 

mi] Ti i i i 
A = 2321-0 1 0 -1 

3432 o 1 0-1 


10 12 
0 10-1 
0 0 0 0 


of rank 2 


Setting * 3 = s, * 4 = t where «,t£ Q, 
*1 = — s — 2t, *2 — t, * 3 = a, * 4 = t. 


obtain the required solutions as: 

See also Problem 19. 
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DETERMINANT OF A SQUARE MATRIX 


To each square matrix A over f there may be associated a unique element a Gf. This 
element a, called the determinant of A is denoted either by det A or |A|. Consider the 
ra-square matrix 

Pan cti 2 ai3 ... a,„1 


021 022 023 . • • 02n 
031 032 033 . . . ®3n 


Onl 0„2 On3 . . . On n 


and a product 


Oijj 02 j 2 a 3 j a ... a n j n 


of n of its elements selected so that one and only one element comes from any row and one 
and only one element comes from any column. Note that the factors in this product have 
been arranged so that the row indices (first subscripts) are in natural order 1,2,3, .... n. 
The sequence of column indices (second subscripts) is some permutation 


P = U v J 2 , J 3 , .. 

of the digits 1,2,3, ..For this permutation, define = +1 or -1 according as p is 
even or odd and form the signed product 


(O) (p Olj, o 2 j 2 asj 3 ... On i n 

The set S„ of all permutations of n symbols contains nl elements; hence, n! distinct signed 
products of the type (a) can be formed. The determinant of A is defined to be the sum of 
these «! signed products (called terms of |A|), i.e., 

(b) |A| = 2 «p ®Ui 0% a *h • • ■ an ’n 

Example 23: 

I ®U «12 1 

(i) = «12®ll a 22 + ( 21 a l2<hl ~ a ll a 22 ~ °12«21 

| «21 ®22 | 

Thus, the determinant of a matrix of order 2 is the product of the diagonal elements of the 
matrix minus the product of the off-diagonal elements. 


(ii) 


“11 “12 “13 

“21 “22 “23 

“31 ®32 “33 


e 123®ll“22“33 + «132“ll®23®32 + «213“l2®21 a 33 

+ e 23l“l2®23“31 + «312“l3“2l“32 + «32l“l3 a 22®31 


“ll“22®33 — ®ll“23®32 — “l2°2l“33 + “l2®23 a 31 + “l3®21®32 “l3 a 22 0 31 

“ll(®22®33 — °23®32) — “l 2 (“21®33 — ®23®3l) + “l3(“ 2 l®32 — °22®3l) 


“22 “23 

“32 “33 


“l2 


“21 

“31 


“23 

“33 


(- 1)1 + * “„ 


“22 “23 

“32 “33 


+ (—D* 


+ (—D 1 


'“13 


“21 “22 
“31 “32 


I “21. ®22 



2 “12 


“21 

“31 


“23 

“33 


called the expansion of the determinant along its first row. It will be left for the reader to 
work out an expansion along each row and along each column. 
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PROPERTIES OF DETERMINANTS 

Throughout this section, A is the n-square matrix whose determinant |A| is given by 
(b) of the preceding section. 

From (b) there follow easily 

Theorem XIX. If every element of a row (column) of a square matrix A is zero, then 
|A| = 0. 

Theorem XX. If A is upper (lower) triangular or is diagonal, then |A| = ana 22 a 3 3 • • • a nn , 
the product of the diagonal elements. 

Theorem XXI. If B is obtained from A by multiplying its ith row (ith column) by a 
non-zero scalar k, then \B\ = &|A|. 

Let us now take a closer look at (a). Since P is a mapping of S = {1,2,3, . ..,«} into 
itself, it may be given (see Chapter 1) as 

p - Ip = J\, 2 P = j 2 , 3 P = j 3 , ..., n P = j n 
With this notation, (a) takes the form 

(a') t p ai.ip a 2 , 2p a 3 , 3 p ... a„, np 

and (b) takes the form 

(&') |^| = ^ tp <*1,1 p <*2,2p <*3,Sp . . . <*n,np 

Since S n is a group, it contains the inverse 

P _1 : iiP’ 1 = 1, hp- 1 = 2 ’ hP~ l = 3 » ■ • Lp~ 1 = » 
of p. Moreover, P and are either both odd or both even. Thus (a) may be written as 

t p-> a J 1 p-«,l 1 Ojjp-iJ 2 <*ljp-i,j 3 . . . <*j n p-i,j„ 

and, after reordering the factors so that the column indices are in natural order, as 

(a") t p -. <*ip-..ia 2 p-, i2 a 3 p-i i8 ... a np -., n 

and (b) as ( b") |A[ = 2 «p-. a lp - K1 o 2p -.. 2 B 3 p-.. 3 ... a np -,, n 

For each square matrix A = [an] define transpose A, denoted by A T , to be the matrix 
obtained by interchanging the rows and columns of A. For example, 


an 

ai2 

ai3 an 

a 2 j 

a3i 

when A = a 2 i 

a 22 

a 23 then A T = a. 2 

a 22 

a 32 

a3i 

a3 2 

a33 an 

a 2 3 

a 3 3 


Let us also write A T — [aj], where a„ = an for all i and j. Then the term of |A T | 
£ p a bj l <* 2 ,j 2 a. 3 j s ... al,j n = t p a )iA a j2 _ 2 3 ... a jn _ n 


is by ( b") a term of |A|. Since this is true for every P G S„, we have proved 
Theorem XXII. If A T is the transpose of the square matrix A, then |A T | — |A|. 

Now let A be any square matrix and B be the matrix obtained by multiplying the ith 
row of A by a non-zero scalar k. In terms of elementary matrices B = Hdk)-A\ and, by 
Theorem XXI, |S| = - Aj = *|A| 
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But \Hi(k)\ = k; hence, \H t (k)-A\ = \Hi(k)\ • \A\. By an independent proof or by Theorem 
XXII, we have also |A.X,(fc)| = |A| • |^)| 

Next, denote by B the matrix obtained from A by interchanging its ith and ith columns 
and denote by r the corresponding transposition (i, j). The effect of r on (o') is to produce 

(a’”) £pTa'i, 1 p T a 2 , 2 p T a 3 , 3 p T ... a n>npT 

hence, |B| = 2 l P r a i,lpr (h.ipr a 3,3pr • • • ®n.npr 

Now u = pr G S n is even when P is odd and is odd when p is even; hence, ta — —t p . More¬ 
over, with r fixed, let p range over S n ; then <, ranges over S„ and so 

\B\ = la 02,2.7 03 , 3,7 ••• On.no- = ~\ A \ 

We have proved 

Theorem XXIII. If B is obtained from A by interchanging any two of its rows (columns), 
then \B\ = — |A|. 

Since in Theorem XXIII B = A • K iS and \K»\ = -1, we have \A‘Ki,\ = |A|*|X«| 
and, by symmetry, \Hu’A\ = \Hij\ • \A\. 

there follows readily, excluding all fields of characteristic two, 

Theorem XXIV. If two rows (columns) of A are identical, then \A\ = 0. 

Finally, let B be obtained from A by adding to its ith row the product of k (a scalar) 
and its jth row. Assuming j < i, 

\B\ = £ e p a U p ... a Up ... a^^Dp (a Up + ka Up ) a i+1 , (i + 1)p ... a n ,„ p 

= S e p a i,lp ®2,2p a 8,3p • • • a n,np 

+ 2 £ p«i.ip • • • °i.ip • • • a ^i.<i-i>p( fca i.jp) a i+i.(i+np- • • a «."P 
= |AJ + 0 = |A| (using ( b ') and Theorems XXI and XXIV) 

We have proved (the case j>i being left for the reader) 

Theorem XXV. If B is obtained from A by adding to its ith row the product of k 
(a scalar) and its jth row, then |B| = |A|. The theorem also holds when 
row is replaced by column throughout. 

Since, in Theorem XXV, B = H t ,(k) ■ A and |H«(fc)} = |/| = 1, we have 
\Hii(k)-A\ = |H«(fc)|* \A\ and \A - R^k)] = \A\ ■ \ K i} (k)\ 

We have now also proved 

Theorem XXVI. If A is an n-square matrix and H(K) is any n-square elementary row 
(column) matrix, then 

|tf-A[ = |H| • |A| and |A -K\ = |A| • |K| 

By Theorem IX', any square matrix A can be expressed as 

( C ) a = Hr 1 • ... Hr 1 • n • k t 1 ... k, t 1 • x:r x 

Then, by repeated applications of Theorem XXVI, we obtain 
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\A\ = \Hr l \-\H^ ... h ; 1 -n-k ; 1 ... K^-KZ 1 ] 

= |flr‘| • \H* l \ • |^3 _1 .. . H7 l • N • K^ 1 .. . K, T 1 • Kr l \ 


= \Hi ‘| • \h 2 x |... itfr‘1 • jiv| • i^r 1 ! • ■ • \k; 1 \ • | k^\ 

If A is non-singular, then N = 1 and |IV| = 1; if A is singular, then one or more of the 
diagonal elements of AT is 0 and \N\ = 0. Thus, 

Theorem XXVII. A square matrix A is non-singular if and only if |A|^0. 
and 

Theorem XXVIII. If A and B are n-square matrices, then |A • Z?j = |A| • |S|. 


EVALUATION OF DETERMINANTS 

Using the result of Example 23 (ii), we have 


1 2 3 

4 5 6 

5 7 8 

= (40 - 42) - 2(32 - 30) + 3(28 - 25) 
= -2-4 + 9 = 3 


(1) 


( 2 ) 


+ (3) 


4 5 

5 7 | 


The most practical procedure for evaluating |A| of order n^3, consists in reducing A 
to triangular form using elementary transformations of the types H^k) and K tj (k) ex¬ 
clusively (they do not disturb the value of |A|) and then applying Theorem XX. If other 
elementary transformations are used, careful records must be kept since the effect of Hu 
or Ku is to change the sign of |A| while that of Hi(k) or Kj(k) is to multiply |A| by k. 

Example 24: An examination of the triangular forms obtained in Example 6 and Problem 8 shows 
that while the diagonal elements are not unique, the product of the diagonal ele¬ 
ments is. From Example 6(a), Page 171, we have 

I 1 2 3 I |l 2 3 I I 1 2 3 I 

|A| = 4 6 6 = 0 -3 -6 = 0 -3 -6 = (l)(-3)(-l) = 3 

I 5 7 8 | | 0 -3 -7 I | 0 0 -1 | 

See also Problem 20. 


Solved Problems 


1. Find the image of $ — (1,2,3,4) under the linear transformation A = 
of Vi (Q) into itself. 


1 -2 
2 4 

0 -1 
1 3 




(t * yi. (• y 2 > i * y 3 » i • y*) 


(See Problem 15, Chapter 13, Page 158.) 


0 4 
1 -2 
5 -1 
2 0 


€[vi vz r 3 y 4 ] 
(9,15,25, -3) 




186 


MATRICES 


[CHAP. 14 


2 . 


Compute A • B and B • A, given 



and B = [4 5 6]. 


A • B = j 


j * [4 5 6] = 

1- 4 1-5 1-6^ 

2- 4 2*5 2 • 6 


4 5 6 

8 10 12 



3*4 3*5 3-6 


12 15 18 


and 


[4 5 6] 


2 

3 


[4 


• 1 + 6 • 2 + 6 • 3] = [32] 


3. When A 


2 -2 

0 1 


and 



1 0-1 
3-2 0 

1 -1 -1 


find A • B. 


2 + 2 1 + 6-2-4 + 2-1 + 2 _ 4 5-2 1 

6 3 + 1 -1 — at — lj [6 4 -1 -4 


4. Show that the linear transformation 
whose image is 0. 


12 2 0 

2 5 3 1 

3 8 4 2 
2 7 13 


in Vt (R) is singular and find a vector 


Using in turn 


H 21 (- 2), H 3 ,(-3), tf 4 i(—2); H 32 (- 2), H 42 <-3), we 


12 2 0 

2 5 3 1 

3 8 4 2 
2 7 13 


1-11 
2-2 2 
3-3 3 




The transformation is singular, of rank 2. 

Designate the equivalent matrices as A,B,C respectively and denote by pi,p 2 ,p s ,P 4 the row 
vectors of A, by p', p', p', p' the row vectors of B, and by p", p", p", p" the row vectors of C. 
Using the moves in order, we have 

p' = p 2 —2p 1( p' = Pa — 3pj, p' = P4 — 2pj 

p" = pi — 2p^, p" = P 2 — 2pj, p^' = P^ — 2p^ 

Now p" = P^-Zp; = (p 3 ^Pi) 2(p2 2pi) = P3-2p 2 + Pi = 0 

while p 4 ' = p 4 - 3p' = (p 4 - 2 Pl ) - 3(p 2 - 2 Pj ) = p 4 - 3p 2 + 4 Pl = 0 

Thus, the image of £ = (1,-2,1,0) is 0; also, the image of u = (4, - 3 , 0 , 1 ) is 0. Show that the 
vectors whose images are 0 fill a subspace of dimension 2 in V 4 (R). 


5. Show that the linear transformation 


A = 


12 3 

2 13 

3 2 1 


is non-singular. 


We find 

1 2 3 
A = 2 13 

3 2 1 



The row vectors of A are linearly independent; the linear transformation A is non-singular. 
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1 2 2 4 2 

6. Find the rank of the linear transformation A = 2 5 3 10 7 of V a (R) into V S (R). 

3 5 7 10 4 


2 5 3 10 7 — 0 1-1 2 3 ~ 

|_3 5 7 10 4J 0—1 1-2-2 

The image vectors are linearly independent; i 


7. From the set {(2,5,0,-3), (3,2,1,2), (1,2,1,0), (5,6,3,2), (1,-2,-1,2)} of vectors 
in V 4 ( R ), select a maximum linearly independent subset. 

The given set is linearly dependent (why?). We find 

2 5 0 —3~j To 1-2 —3~| To 1 -2 -3~ 

3 2 1 2 0 -4 -2 2 0 0 -10 -10 

A = 1210-1210-10 5 6 

5632 0 -4 -2 2 00 -10 -10 

_1 -2 -1 2J |_0 —4 —2 2 0 0 -10 -10 


0 0 -10 -10 0 0 0 0 

|_0 0 -10 -10J |_0 0 0 oj 

From an examination of the moves, it is clear that the first three vectors of A are linear combinations 
of the three linearly independent vectors of B (check this). Thus, {(2, 5,0,-3), (3, 2,1,2), (1,2,1,0)} 
is a maximum linearly independent subset of A. Can you conclude that any three vectors of A are 
necessarily linearly independent? Check your answer by considering the subset (1,2,1,0), (5, 6,3.2) 
( 1 ,- 2 ,- 1 , 2 ). ’ h 


8. By means of elementary column transformations, reduce A = 4 5 6 to upper 

triangular, lower triangular, and diagonal form. I 5 7 8 

Using A 13 (—2/3), K- 23 (-5/6); K l2 ( 1), A M (-1/24), we obtain 

[~1 2 3~| r -1 -1/2 3~j [”-3/2 -5/8 3I 

4 = 4 5 6 r 0 0 6 - 0 —1/4 6 which is upper triangular. 


Using tf 21 (—2), A sl (-3); * S2 (-2) 


1 0 0 

4—3 0 which is lower triangular. 


Using K 2l (—2), K 3l (—3), K 32 (— 2); K ls ( 5), X 23 (—3); A 12 (4/3) we obtain 

100 100 100 

A ~ 4 —3 0 — 4 —3 0 — 0 —3 0 which is diagonal. 

5 -3 -1 0 0 -1 0 0 -1 
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9. Prove: Any non-zero matrix A over f can be reduced by a sequence of elementary 
row transformations to a row canonical matrix (echelon matrix) C having the properties: 

(i) Each of the first r rows of C has at least one non-zero element; the remaining 
rows, if any, consist entirely of zero elements. 

(ii) In the ith row (i = 1,2, ..., r) of C, its first non-zero element is 1, the unity of f. 
Let the column in which this element stands be numbered ju 

(iii) The only non-zero element in the column numbered (i = 1,2,..., r) is the 
element 1 in the ith row. 

(iv) jl<h< •■■ < jr. 

Consider the first non-zero column, numbered of A; 

(а) If a lf ¥> 0, use I/dcqq) to reduce it to 1, if necessary. 

(б) If a Ml — 0 but a PJi ¥= 0, use H lp and proceed as in (a). 

(c) Use transformations of the type H n (k) to obtain zeros elsewhere in the j t column when necessary. 

If non-zero elements of the resulting matrix B occur only in the first row, then B = C; otherwise, 
there is a non-zero element elsewhere in the column numbered j 2 > j\. If b 2 j 2 0, use ZT 2 (6y‘) as 

in (a) and proceed as in (c); if 6 2 j 2 = 0 but 6,^ ¥• 0, use H 2q and proceed as in (a) and (c). 

If non-zero elements of the resulting matrix occur only in the first two rows, we have reached C; 
otherwise, there is a column numbered j 3 > j 2 having non-zero elements elsewhere in the column. 
If ... and so on; ultimately, we must reach C. 


10. Prove: The row rank and column rank of any matrix A over f are equal. 

Consider any mXn matrix and suppose it has row rank r and column rank s. Now a maximum 
linearly independent subset of the column vectors of this matrix consists of s vectors. By inter¬ 
changing columns, if necessary, let it be arranged that the first s columns are linearly independent. 
We leave for the reader to show that such interchanges of columns will neither increase nor decrease 
the row rank of the given matrix. Without loss in generality, we may suppose in 


A 


«n 

®21 


®*1 


®ml 



®1.* + 1 ••• ®ln 
®2.s + l ••• ®2n 


®m.* + l ••• ®m.n 


the first s column vectors yi,y 2 , ...,y« are linearly independent while each of the remaining n-8 
column vectors is some linear combination of these, say, 


y»+t = CtiYi + c t2 y 2 +••*■ + c u y„ (* - 1 , 2, ...,«- a) 


with Cjj G f. Define the following vectors: 

Pi — (®11> a 12.®ls), P2 = (®21>®22> • • >®2s), •■•> Pm — (®ml> ®m2.®ms) 

and O'! = («!!, ®2!, • ■ ■»®* + l,l)« ®2 = (®12>®22> • •->®* + 1.2)> a n ~ (®ln> ®2n> ■ • ■ > ®s + l.n) 


Since the p’s lie in a space V* (f), any 8 +1 of them forms a linearly dependent set. Thus there exist 
scalars 6 1( b 2 , .. ,,b s+1 in f not all 0 such that 

blPl + &2P2 + ‘ ‘ ’ + b,+ lp s +l = (& 1®11 + b 2®21 + ’ ‘ ‘ + 6 *+l®s+l,l> bid 12 + 6 2 a 22 + • • • 

+ b s+1 a s+U2 , ■ ■ bi«is + b 2®2s + • • ’ + b s +i®s + i.s) 

= (£ * ®i» £ ‘ ®2»■••»{* ®«) — f 

where f = (0,0,.. .,0) = 0 is the zero vector of V,(J) and £ = (b lt b 2 , . . .,b s+1 ). Then 
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Then 


£ ' ®i = | • a 2 = 

Consider any one of the remaining <r’s, say, 

"s + k = «2.*+k. ■ • tt s +l,a + k) 

= (Ckl a U + c k2°12 + ' ‘ - + Cfcs®ls> c kl®21 + c k2 a 22 + ' ’ * + C ks a 2s , . . ., 
c kl°s+l.l + Ck2“*+1,2 + • ' • + Ck»“«+l, s ) 
i * a s + k ~ c kl(i * CT l) + c ki(i * a i) + ••• + C ks ({ • <j g ) — 0 

Thus any set of s + 1 rows of A is linearly dependent; hence s — r, that is, 

the column rank of a matrix cannot exceed its row rank. 

To complete the proof, we must show that r ^ a. This may be done in either of two ways: 

(i) Repeat the above argument beginning with A, having its first r rows linearly independent, and 
concluding that its first r + 1 columns are linearly dependent. 

(ii) Consider the transpose of A . 

i a n a 21 

Ojj OJ2 


whose rows are the corresponding columns of A. Now the row rank of A T is s, the column 
rank of A, and the column rank of A T is r, the row rank of A. By the argument above the 
column rank of A T cannot exceed its row rank; i.e. r ^ s. 

In either case we have r = s, as was to be proved. 


11. Reduce A = 


3 2 3 4 5' 

2-1451 
4512-3 


over R to normal form. 


e H l2 (—1) to obtain the element 1 

A = 


the first row and first column; thus 
3 -1 -1 


5 12-3 

Using H 2l (- 2), H 31 (- 4), K 21 (- 3), K 3l ( 1), X 4I (1), * 51 (-4), 


0 -7 
0 -7 


Then using H 32 (-1), K 2 {-l/7), A 32 (-6), K 42 (-7), K 52 ( 7), we have 
A ~ 

and finally, using H 3 (- 1), K 43 (-l), A 53 (-12), 

1 0 0 0 0 
0 10 0 0 
0 0 10 0 
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1 

1 

1 








Reduce A - 


2 

1 

-3 

-6 

over 

R to normal form N 

and find matrices S and T 



3 

3 

1 

2 







such that S • 

•A' 

•r = 

N. 









We find 












1 

0 

0 

0 




10 0 

0 




0 

1 

0 

0 




0 1 0 

0 




0 

0 

1 

0 




0 0 1 

0 




0 

0 

0 

1 




0 0 0 

1 




1 

1 

1 

2 

1 

0 

0 

1 1 1 

2 

1 

0 

0 

h 2 

1 

-3 

-6 

0 

1 

0 

0-1-5 

-10 

-2 

1 

0 

A h = 3 

3 

1 

2 

0 

0 

l -* 

0 0-2 

-4 

-3 

0 

l 



1 

0 

-1 

1 

-1 

0 

-2 

0 


1 

0 

-1 

1 

-1 

0 

-2 

0 




0 

0 

1 

0 


0 

0 

1 

0 




0 

0 

0 

1 


0 

0 

0 

1 




1 

0 

0 

0 

1 0 

0 1 

0 

0 

0 

10 0 



0 

-1 

-5 

-10 

-2 1 

0 0 

1 

5 

10 

2-1 0 


-* 

0 

0 

-2 

-4 

-3 0 

1 -» 0 

0 

1 

2 

3/2 0 -1/2 



1 

0 

-1 

1 

-1 

0 

-2 

0 



1 

0 

-1 

1 

-1 

0 

0 

0 



0 

0 

1 

0 



0 

0 

1 

-2 



0 

0 

0 

1 



0 

0 

0 

1 



1 

0 

0 

0 

1 

0 0 

1 

0 

0 

0 10 0 



0 

1 

0 

0 

-11/2 - 

-1 5/2 

0 

1 

0 

0 -11/2 1 5/2 T 



► 0 

0 

1 

2 

3/2 

0 -1/2 -> 

0 

0 

1 

0 3/2 0 -1/2 = N S 


Hence S = 


10 0 
-11/2 -1 5/2 

3/2 0 -1/2 


and T = 


1 - 1-1 0 
0 10 0 

0 0 1-2 

0 0 0 1 


13. Prove: The inverse of the product of two matrices A and B, each of which has an 
inverse, is the product of the inverses in reverse order, that is, 

(A'B)- 1 = B~ l • A~ l 


By definition, (A • B)~ l ‘(A'B) = (A • B)(A • B) 1 =/. Now 


(B-i-A-i) • (A*B) = £-i(A-i’A)B = B~ l • 1 • B = B~ l • B = / 
and (A • B)(B~ l • A~ l ) = A(B-B~ l )A~ l - A • A -1 = I 

Since (A >B)~ l is unique (see Problem 33), we have (A •B)~ 1 = B~ l • A -1 . 


14. Compute the inverse of A = 


1 2 4 
3 10 

2 2 1 


over 7/(5). 


We have 

[A / 3 ] = 


12 4 10 0 
3 10 0 10 
2 2 1 0 0 1 


12 4 10 0 
0 0 3 2 1 0 
0 3 3 3 0 1 


12 4 10 0 
0 3 3 3 0 1 
0 0 3 2 1 0 


12 4 10 0 
0 1110 2 
0 0 1 4 2 0 


1 0 2 4 0 1 
0 1110 2 
0 0 1 4 2 0 


10 0 111 
0 1 0 2 3 2 
0 0 1 4 2 0 


and 


A-i 


111 
2 3 2 
4 2 0 
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"l 1 r 

15. Find the minimum polynomial of A = 0 2 0 over R. 

_! 1 1 


Clearly, A # a 0 / for all o 0 G R. Set 



which is impossible. Next, set 



T 2a 2 + a x + a 0 = 4 

From <j 4a 2 + a t = 12, we obtain a 0 = 0, o, = —4, a 2 = 4. After checking for every element 

[2a 2 + a, =4 

of A 3 and not before, we conclude m(\) = X s — 4X 2 + 4X. 

1 2*1 + 2*2 + 3*3 + *4=1 
3*1 - *2 + *3 + 3*4 = 2 

-2*i + 3*2 - *3 - 2*4 = 4 over R. 

*i + 5*2 + 3*3 — 3*4 = 2 

2*i + 7*2 + 3*3 — 2*4 = 8 

We have 

2 2 3 1 1~1 I - 1 5 3 -3 2~1 Tl 5 3 -3 2~ 

3 -1 1 3 2 3 -1 1 3 2 0 -16 -8 12 -4 

[A H] = -2 3 -1 -2 4 ~ -2 3 -1 -2 4 - 0 13 5 -8 8 

153 -3 2 2231 1 0 -8 -3 7 -3 

2 7 3 -2 8_J [_ 2 7 3 -2 8_ [_0 —3 —3 4 4_ 

“1 5 3 -3 2 “I Tl 0 1/2 3/4 3/4~I |“l 0 1/2 3/4 3/4 _ 

0 1 1/2 -3/4 1/4 0 1 1/2 -3/4 1/4 0 1 1/2 -3/4 1/4 

-0 13 5 -8 8 -00 -3/2 7/4 19/4 - 0 0 1 1 -1 

0 -8 -3 7 -3 0 0 1 1 -1 0 0 -3/2 7/4 19/4 

0 -3 -3 4 4 J [_0 0 -3/2 7/4 19/4J |^0 0 0 0 0 _ 

"lOO 1/4 5/4~| Tl 0 0 1/4 5/4 I |~1 0 0 0 1~ 

0 1 0 -5/4 3/4 0 1 0 -5/4 3/4 0 1 0 0 2 

-001 1 -1 - 001 1 -1 - 0010 -2 

0 0 0 13/4 13/4 0001 1 00011 

0 0 0 0 0 |_0 0 0 0 0 _0 0 0 0 0_ 

Both A and [A H] have rank 4, the number of unknowns. There is one and only one solution: 
= 1, x 2 — 2, * 3 = —2, x 4 = 1. 

Note. The first move in the reduction was H 14 . Its purpose, to obtain the element 1 in the first 
row and column, could also be realized by the use of 
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17. Reduce 


3 2 1 
6 5 4 

4 2 5 


over 1/(7) to normal form. 


Using H 2i (l), H al (3); H 12 (4), tf 32 (3); H 3 (3); tf la (l), H u (5), 


3 2 1 
6 5 4 

4 2 5 


13 5 
6 5 4 
4 2 5 


10 6 
0 12 
0 0 5 


10 6 
0 12 
0 0 1 


1 0 0 
0 10 
0 0 1 


18. Find all solutions, if any, of the system 


)- 2x 2 + X 3 + 3 x 4 = 4 
(- x 2 4- 3xa 4- 2 x 4 = 1 
2x 2 4- X 3 4- X 4 = 3 
t- x 2 4- 3 x 3 + 4 x 4 = 2 


over 1/(5). 


12 13 

2 13 2 
0 2 11 

3 13 4 


12 13 
0 2 11 
0 2 11 
0 0 0 0 


12 13 
0 13 3 
0 2 11 
0 0 0 0 


10 0 2 
0 13 3 
0 0 0 0 
0 0 0 0 


Here, r A = r [A H] = 2; the system is compatible. Setting x 3 = s and x 4 = t, with g,t G 1/(5), 
solutions are given by 

*! = 1 4 3t, x 2 = 4 4 2s 4 2t, 


Since 1/(5) is a finite field, there is only 


* 4 = t 

1 finite number (find it) of solutions. 


19. Solve the system 


2xi + x 2 4- Xs = 0 
xi 4- x 3 = 0 over 7/(3). 
2x 2 4- xs = 0 


We have 


Then assigning 


A 


211 122 122 101 

101 ~ 101 ~ 012 ~ 012 
021 021 021 000 


x 3 = 8 € 1/(3), we obtain * 1 = 2», x 2 = x s = s as the required solution. 


20. With each matrix over Q, evaluate: 

I 0 1 -3 I 1-1 1 -3 I 1-1 

(o) 2 5 4 = -3 5 4 = -3 

|-3 2 -2 I 1-5 2 -2 I |-5 

K 12 (—l) is used to replace a u = 0 by a 
using K l2 , then 

I 0 1 -3 I I 1 0 -3 I 

2 5 4 = — 6 2 4 = 

|-3 2 -2 I | 2 -3 -2 I 

An alternate evaluation is as follows: 

I 0 1 -3 I I 0 1 0 I 

2 5 4 = 2 5 19 

| -3 2 -2 | I -3 2 4 | 


0 0 I I -1 0 0 I 

2 13 = 2 5 0 = -65 

-3 13 I I -5 -3 13 I 

non-zero element. The same result can be obtained by 

I 1 0 0 I | 1 0 0 I 

-I 5 2 19 = - 5 2 0 = -65 

I 2 -3 4 I | 2 -3 65/2 | 


= -(1) _g ^ = -(8 4 57) = -65 
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(b) 


2 3-2 

3 -2 1 

3 2 3 


-2 4 0 


-1 3-2 4 

5-212 
12 3 4 

-6 4 0 5 


-10 0 0 
5 13 -9 22 

15 18 

-6 -14 12 -19 


-10 0 0 

-1-1 3 3 

15 18 

-6 -14 12 -19 

(-1)(-1)(16)(—143/8) 


-10 0 0 
- 1-1 0 0 
1 5 16 23 

-6 -14 -30 -61 
-286 


-10 0 0 

- 1-10 0 
1 5 16 0 

-6 -14 -30 -143/8 


Supplementary Problems 


21. Given A 


1 0 2 
0 3 1 
4 2 0 


12 3 

13 4 , C = 

14 3 


-7 6 -1 

1 0-1 
1-2 1 


over Q, compute: 



2 2 5 


3 10 9 


[ -5 2 1 

(a) A + B = 

16 5 

5 6 3 

(c) A-B = 

4 13 15 

6 14 20 

(«) A-C = 

4-2-2 

,-26 24 -6 


(6) 3A = 

3 0 6 

0 9 3 

(d) B‘C ~ 

-2 0 0 

0-2 0 

(/) A 2 = A> A = 

9 4 2 

4 11 3 


12 6 0 


0 0 -2j 


4 6 10 


22. For the arrays of Problem 21, verify: (a) (A 4- B)C = AC + BC, (b) (A • B)C = A(B • Q. 

23. For A = [ay], (t = 1,2,3; j = 1,2,3), compute / 3 ’A and A • I 3 (also 0 3 • A and A • 0 3 ) to verify: 
In the set % of all n-square matrices over f, the zero matrix and the identity matrix commute with 
all elements of %. 


24. Show that the set of all matrices of the form 

^s(Q). 


0 

0 



, where a, b, 


S Q, is a subalgebra of 


25. Show that the set of all matrices of the form 

(R). 


a b c 
0 a + e 0 
c b a 


where 


a,b,cG R, is a subalgebra of 


26. Find the dimension of the vector space spanned by each set of vectors over Q. Select a basis for each. 

(а) {(1,4,2, 4), (1,3,1,2), (0,1,1,2), (3,8,2,4)} 

(б) {(1,2,3,4,5), (5, 4,3,2,1), (1,0,1,0,1), (3,2, -1, -2, -5)} 

(c) {(1,1,0,-1,1), (1,0,1,1,-1), (0,1,0,1,0), (1,0,0,1,1), (1,-1,0,1,1)} 

Ans. (a) 2, (6) 3, (c) 4 


12 3 0 

2 4 3 1 

3 2 14 
2 0 4 2 


27. Show that the linear transformation A = 
vector whose image is 0. 


of V 4 (R) into itself is singular and find a 
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Prove: The 3-square matrices /, H n , H 13 , H 23 , H 12 • H 13l H 12 • H 23 under multiplication form i 
group isomorphic to the symmetric group on 3 letters. 

Prove: Under multiplication the set of all non-singular n-square diagonal matrices over f is i 
commutative group. 


30. Reduce each of the following matrices ove 
12 12 
13 2 2 

2 4 3 4 

3 7 4 6 


v equivalent canonical matrix: 


3 3 12 



3 

4 

6 

6 

7 

8 

(e) 

4 

6 

6 

7 

8 

9 

5 

6 

7 

8 

9 

8 


10 

11 

12 

13 

14 

16 


2 -2 
5 -4 



- 0 o o" 


10 0 2 


1 0 -1 -2 -3 0 

I" 1 ° - 7 ] (6) 

0 10 0 

(c) 

0 10 0 

(d) h («) 

0 1 2 3 4 0 

L° 1 2 J 



0 0 10 


0 0 0 0 0 1 


_0 0 1 2J 


0 0 0 0 


0 0 0 0 0 0 


31. In Example 11, Page 176, use H 2 ($), H a 2 (-l), tf 23 (-5), K a ( 2) 
1-2 1 



- “1 
1 0 0 


1 -2 2 

and obtain S = 

11 3 -6 

-6/2 -1/2 1 

and T = 

0 10 

0 0 2 


10 0 10 0 

0 2 6 -3 1 0 

013-401 

to show that the non-singular matrices S and T such that S • A ■ T = / are not unique. 

r er R to normal form N and compute matrices S and T such that 


0 4 


32. Reduce A = 

S'A • T = N. 

33. Prove that if A is non-singular, its inverse A -1 is unique. 

Hint. Assume A • B — C • A = I and consider (C • A)B = C(A • B). 

34. Prove: If A is non-singular, then A • B — A • C implies B = C. 

35. Show that if the non-singular matrices A and B commute, so also do (a) A -1 and B, (6) A and B~ l , 
(c) A -1 and B~K 

Hint, (o) A _1 (A • B)A~ l = A~HB • A)A~K 

36. Find the inverse of: 


13 3 

14 3 
13 4 

12 3 

2 4 6 

3 6 6 


12 3 

13 3 
2 4 3 


3 4 2 7 
2 3 3 2 
6 7 3 9 
2 3 2 3 
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7-3-3 


1-3 2 


3-6 3 

Ana. (a) ^ 

-1 1 0 
-1 0 lj 

, (6) 

-3 3 -1 

2-1 0 

, M i 

-3 3 0 

2 0-1 


«*> A 


37. Find the inverse of A = 


0 2 1 
2 10 
112 


1 3 -5 


3-1 5 

-5 5 -5j 

, (e) i 


-1 11 7 —2( 

-1 -7 -3 1< 


W * 


2 16 -6 4 

22 41 -30 -1 

-10 -44 30 -2 

4 -13 6 -1 


10 1 
1 1 1 
2 1 1 


r 1/(3). Does A have an inverse over 7/(5) ? 


of (a) 

0 10 

0 0 1 

, (6) 

2 0 0 

0 10 

, (c) 

1 1 2 

1 1 2 

, (d) 

2 11 

12 1 


1 2-1 


0 0 1 


112 


112 


Ana. (a) X 8 + X 2 - 2X - 1, (6) X 2 - 3X + 2, (e) X 2 - 4X, (d) X 2 - 5X + 4 
39. Find the inverse of each of the matrices ( a),(b),(d) of Problem 36, using its minimum polynomial. 

m-singular matrix A and obtain a 


40. Suppose X 8 + oX 2 + 6X 
contradiction. 


the minimum polynomial of i 


41. Prove: Theorems XIX, XX and XXI, Page 183. 

(Hint. If the tth and jth rows of A i 


Prove Theorem XXIV 
Theorem XXVIII. 


s identical, |A|. = \H tj \ • |A|) and 


43. Evaluate: 



1 1 2 3 

1 -7 6 

-1 1 

(“) 1 3 4 

M l o 

-1 (e) 

| 1 4 3 

1 1 ~2 

1 1 

1 1 0 2 

j 1 2 -1 

1 1 

(6) 0 3 1 

(d) 3 2 

4 (f) 

1 4 2 0 

1-1 0 

3 1 


1116 
2 4 16 
4 12 9 
2 4 2 7 
3 5 7 2 
2 4 11 
-2 0 0 0 


Ana. (a) -2, (b) -26, (c) 4, (d) 27, (e) 41, (f) 156 


44. Evaluate: (a) 

X — 1 2 3 

1 X — 3 4 

. w 1 

X — 2 -1 -4 1 

-1 X — 3 -5 

\ 1 4 X — 3 | 

Hint. Expand along the first row or 

! 1-4-5 X —6 | 

first column. 


Ana. (a) X s - 7X 2 - 6X + 42, (b) X 8 - 11X 2 - 6X + 28 

Denote the row vectors of A = [o (j ], (i,j = 1,2,3), by Pl> p 2 ,p 3 . 

(a) Pl X P2 (see Problem 13, Chapter 13, Page 157) can be found a 


Show that 

follows: Write the array 


°22 °23 “21 “22 


and strike out the first column. Then 

— (i::M 

(b) |A| — Pj • (p 2 X p 3 ) — —p 2 • (pj X Pg) = p* • (px X Pg). 


“n “12 

I “21 “22 I 
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46. Show that the set of linear forms 


(o) 


/, = a n x l + o 12 * 2 + • • • + o ln * n 

f 2 = Ojj*, + 022*2 + • • • + a 2 n x n 


[/« = “ml*l + “m2*2 + ' * ' + “«A 
s linearly dependent if and only if the coefficient matrix 

A = [o y ], (* = 1,2,..., m; > = 1,2- n) 

s of rank r < m. Thus (o) is necessarily linearly dependent if m> n. 


47. Find all solutions of: 

(a) x x — 2 x 2 + 3* 3 — 5* 4 = 1 

2*| + * 2 + 5*3 +*4 = 5 


3*j + 6*2 — 2*3 + *4 — 8 
2* t + 2*2 + 2*3 - 3* 4 = 2 


[_ 2*! + 5*2 — 2*3 = 

{ *, + *2 + 2*3 +*4 = 5 

2*, + 3*2 - *3 - 2*4 = 2 
4*. + 5*2 + 3*3 =7 


, + *2 + *3 = 4 

(c) -{ 2*, + 5*2 - 2*3 = 3 
, + 7*2 - 7*3 = 5 


!j + *2 - 2*3 + *4 + 3*5 = 1 

(/) \ 2*4 - *2 + 2*3 + 2*4 + 6*5 = 2 (a) 

+ 2*2 - 4*3 - 3*4 — 9*5 = 3 


*, + 3*2 + *3 + *4 + 2*5 = 0 

2*J + 5*2 — 3*3 + 2*4 - *5 = 3 

-*J + *2 + 2*3 - *4 + *5 = 5 

3*4 + *2 + *3 - 2*4 + 3*5 = 0 


Ans. (a) *4 = 1 + 2r — 3s + 5t, x 2 = r, x 3 — * 4 — t 

(6) *4 = 17/3 - 7r/3, * 2 = -5/3 + 4r/3, * 3 = r 
(d) *i = 2, * 2 = 1/5, * 3 = 0, * 4 = 4/5 
(/) *i = 1, * 2 = 2r, * 3 = r, *4 = —36, *5 = 6 

(ff) *4 = -11/5 - 4r/5, *2 = 2, * 3 = -1 - r, * 4 = -14/5 - r/5, * 5 = r 
48. (o) Show that the set Af 2 = {A,B, ...} of all matrices over Q of order 2 is isomorphic to the vector 


space V 4 (Q). Hint. Use A 


(6) Show that I \4 = 


O44 *12 

*24 0-22 


(041,042,024, ®22>* See Prob. 3, Chap. 10, Page 108. 


0 0 
10 


0 0] 

01 1S a 


basis for the vector 


(c) Prove: A commutes with . 

Hint. B = 644/44 + &12A2 + 624/21 + 622/22- 


* »1: 

~y * 


e /? I . Show (o) S 2 is a vector space o 


Hint. In (b) show that the mapping S 2 -» C 


x v -* i 

-y * 


r R, (6) S 2 is a field, 
an isomorphism. 


50. Show that the set = {(<?i + q 2 i + 93 i + <l* k ) ■ 9i> «2.9s> 94 e with addition and multiplication 
defined in Problem 27, Chapter 11, Page 123, is isomorphic to the set 

9i 92 93 94 

-92 9i —94 93 

-93 94 9i -92 

Is S 4 a field? I I “ 94 ” 93 92 91 
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51. Prove: If £i,£ 2 ,.. are m<n linearly independent vectors of V n (f), then the p vectors 

’i = *jlfl + s }2(2 + ' • ' + Bjmim , O' = 1, 2.p) 

are linearly dependent if p > to or, when p - to, if [*„] is of rank r < p. 


52. Prove: If | 1( £ 2 , ...,| n are linearly independent vectors of V n (f), then the » vectors 

ij = «ji£i + a, 2 £2 + • • • + , O' = If 2,....») 

are linearly independent if and only if |o y | ¥= 0. 

53. Verify: The ring T 2 = -|| a : a, 6, e G fi j. has the subrings 


{[» -1: XG«1, 

{[* y : x.yGR 1, 

and J ® 

* : x,y G R l 

U° °J j 

U° °J J 

ll° 

J 


as its proper ideals. Write the homomorphism which determines each as an ideal. (See Theorem VI, 
Chapter 10, Page 105.) 


54. Prove: (A + B)T = A T + B T and (A • B)* = B T • A T when A and B are n-square matrices o 


55. Consider the n-vectors X and Y as 1 X » matrices and verify: 

X • Y = X • Y T = Y • XT 


56. (a) Show that the set of 4-square matrices 

= {/, H 12 , H l3 , H 14 , H 2S , H m , Hu, H 12 • Hi 3 , Hi 2 • H 2 3, H l2 • tf 14 , 12 • 24 , 13 • H u , 
Hu'H , 3 , H 23 -H 24 , H 24 'H 23 , H l2 • ^34, Hi 3 'H m , H 14 • H 23 , H 12 -H 13 -H l4 , 
Hi 3 -Hi 2 -Hu, Hi 3 -Hu-Hi 2 , Hi 4 -H 12 -Hi 3 , H 14 .H, 3 -H 12 ) 
is a multiplicative group. //»«. Show that the mapping: 

H ir H ik -*(ijk), H tr H kl ->(ij)(kQ, H ir H ik ‘H u ->(ijkl) 
of =3/ into S„ is an isomorphism. 

(6) Show that the subset {/, H i3 , H 24 , H l2 • H 13 -H 24 , tf 14 • f/ 23 , H 12 'H 13 -H U , H u • ff 18 • H l2 > 

of is a group isomorphic to the octic group of a square. (In Fig. 9-1, Page 92) replace the 
designations 1,2, 3, 4 of the vertices by (1,0,0,0), (0,1,0,0), (0,0,1,0), (0,0,0,1) respectively.) 

57. Show that the set of 2-square matrices 



is a multiplicative group isomorphic to the octic group of a square. 

Hint. Place the square of Fig. 9-1, Page 92, in a rectangular coordinate system so that the vertices 
1,2, 3,4 have coordinates (1,-1), (1,1), (-1,1), (-1,-1) respectively. 


58. Let S spanned by (1,0,1,-1), (1,0,2,3), (3,0,2,-1), (1,0,-2,-7) and T spanned by (2,1,3,2), 
(0,4,-1,0), (2,3,-4,2), (2,4,-1,2) be subspaces of V 4 (Q). Find bases for S, T, S n T, and S+T. 
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Matrix Polynomials 


MATRICES WITH POLYNOMIAL ELEMENTS 


Let f[X] be the polynomial domain consisting of all polynomials in A with coefficients 
in f. Anmxji matrix over f [A], that is, one whose elements are polynomials of f[X], 


A (A) = [o«j(X)] 


an(A) a l2 (\) ... am(A) 

®2l(A) <l22(A) . . . tt2n(A) 


fl*nl(A) On>2(A) . . . ttmn(A) 


is called a A-matrix (read, lambda matrix). 


Since f c f [Aj, the set of all m x n matrices over f is a subset of the set of all m X n 
A-matrices over f [A]. It is to be expected then that much of Chapter 14 holds here with, 
at most, minor changes. For example, with addition and multiplication defined on the 
set of all ?i-square A-matrices over f'[X] precisely as on the set of all n-square matrices 
over f, we find readily that the former set is also a non-commutative ring with unity 

On the other hand, although A (A) = ^ J is non-singular, i.e. |A(A)| = A(A + 1) * 0, 

A(A) does not have an inverse over y[A]. The reason, of course, is that generally «(A) does 
not have a multiplicative inverse in y[A]. Thus it is impossible to extend the notion of 


elementary transformations on A-matrices so that, for instance, 


A (A) = 


A 0 
0 A +1 


1 0 
0 1 . ’ 


ELEMENTARY TRANSFORMATIONS 

The elementary transformations on A-matrices are defined as follows: 

The interchange of the ith and jth rows, denoted by i/«; the interchange of the 
ith and ;th columns, denoted by Kq. 

The multiplication of the ith row by a non-zero element k G f, denoted by 
Hi(k)\ the multiplication of the ith column by a non-zero element k&f, denoted 
by Ki(k). 

The addition to the ith row of the product of /(A) e f[x\ and the jth row, 
denoted by #«(/(A)); the addition to the ith column of the product of /(A) G f [A] 
and the jth column, denoted by K fJ {f(\)). 

(Note that the first two transformations are identical with those of Chapter 14 while the 
third permits all elements of f [A] as multipliers.) 

An elementary transformation and the elementary matrix obtained by performing that 
transformation on I will again be denoted by the same symbol. Also, a row transformation 
on A (A) is effected by multiplying it on the left by the appropriate H, and a column trans¬ 
formation is effected by multiplying it on the right by the appropriate K. 
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Paralleling the results of Chapter 14, we state: 

Every elementary matrix is non-singular. 

The determinant of every elementary matrix is an element of f. 

Every elementary matrix has an inverse which, in turn, is an elementary matrix. 
Two mxn A-matrices A(A) and 5(A) are called equivalent if one can be obtained 
from the other by a sequence of elementary row and column transformations, i.e. 
if there exist matrices S(A) = H s ... H 2 'H i and T( A) = Ki'K 2 ... K t such that 

5(A) ’ A(\) • T(X) = 5(A) 

The row (column) rank of a A-matrix is the number of linearly independent 
rows (columns) of the matrix. The rank of a A-matrix is its row (column) rank. 
Equivalent A-matrices have the same rank. The converse is not true. 


NORMAL FORM OF A A-MATRIX 

Corresponding to Theorem IX', Chapter 14, Page 174, there is 
Theorem I. Every mxn A-matrix A (A) over f [A] of rank r can be reduced by elementary 
transformations to a canonical form (normal form) 

7,(A) 0 0 ... 0 0 0 ... 0 

0 / 2 ( A) 0 ... 0 0 0 ... 0 

W(A) = 0 0 0 ... 0 / r (A) 0 ... 0 

0 0 0 ... 0 0 0 ... 0 


L 0 0 0 ... 0 0 0 ... o 

in which f x (A), f 2 ( A), ..., / r (A) are monic polynomials in f [A] and /.(A) divides 
/ j+1 (A) for i= 1, 2, ...,r-l. 

We sha11 not P rove this theorem nor that the normal form of a given A (A) is unique 
(The proof of the theorem consists in showing how to reach N{ A) for any given A(A)- 
uniqueness requires further study of determinants.) A simple procedure for obtaining 
the normal form is illustrated in the example and problems below. 

Example 1: r 

* + 3 A + l A + 2 

Reduce A( A) = 2A 2 + A-3 A 2 + A-l 2A 2 - 2 

A 3 + A 2 + 6A + 3 2A 2 + 2A +1 A* + A* + 5A + 2 

over R( A) to normal form. 

The greatest common divisor of the elements of A (A) is 1; take /,(A) = 1. Now use K (-1) 
to replace a n (A) by /,(A), and then by appropriate row and column transformation obtain an 
equivalent matrix whose first row and first column have zero elements except for the common 
element /i (A); thus, 


1 A + l A + 2 

A-l A 2 + A-l 2A 2 — 2 

A + l 2A 2 + 2A + 1 A 3 + A 2 + 5A + 2 
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Consider now the submatrix 


X 2 -X 
X s + 2X ' 


The greatest common divisor of its elements is X; 


set / 2 (X) = X. Since / 2 (X) occupies the position of & 22 (X) in B(\), we proceed to clear the second 
row and second column of non-zero elements, except of course for the common element / 2 (X), 
and obtain 


A(X) ~ 


0 

0 


0 

X 

X 2 


0 

x 2 -x 

X 3 + 2X 


10 0 

0 X X 2 - X 
0 0 X 2 + 2X 


10 0 

OX 0 

0 0 X 2 + 2X 


N(X) 


since X 2 + 2X is monic. 


See also Problems 1-3. 


The non-zero elements of N(X), the normal form of A(Al, are called invariant factors 
of A (A). Under the assumption that the normal form of a A-matrix is unique, we have 
Theorem II. Two m x n A-matrices over jF[A] are equivalent if and only if they have the 
same invariant factors. 


POLYNOMIALS WITH MATRIX COEFFICIENTS 

In the remainder of this chapter we shall restrict our attention to n-square A-matrices 
over y'[A]. Let A (A) be such a matrix and suppose the maximum degree of all polynomial 
elements ao(A) of A (A) is p. By the addition, when necessary of terms with zero coefficients, 
A(A) can be written so each of its elements has p + 1 terms. Then A(A) can be written 
as a polynomial of degree p in A with n-square matrices A* over f as coefficients, called 
a matrix polynomial of degree p in A. 

Example 2: 

For the X-matrix A(X) of Example 1, we have 


A(X) 


X+3 X+l 

2X 2 + X — 3 X 2 + X — 1 
X 3 + X 2 + 6X + 3 2X 2 + 2X + 1 


X + 2 
2X 2 — 2 


X s + X 2 + 5X + 2 


OX 3 + OX 2 + X + 3 
OX 3 + 2X 2 + X - 3 
X s + X 2 + 6X + 3 

0 0 0 
0 0 0 
10 1 


ox* + ox 2 + x +1 

OX 3 + X 2 + X -1 
OX 3 + 2X 2 + 2X + 1 

1 1 1 
110 
6 2 5 


OX 3 + OX 2 + X + 2 
OX 3 + 2X 2 + OX - 2 
X 3 + X 2 + 5X + 2 

3 1 

X + -3 -1 - 

3 1 


0 0 0 
2 12 
12 1 


Consider now the n-square A-matrices or matrix polynomials 

A (A) = A p A p 4- Ap-i A p_1 + ••• + AiA + A 0 (1) 

and B{ A) = B q \ 9 + B q - ,A’-‘ + ••• + B t \ + B 0 (2) 

The two A-matrices (matrix polynomials) are said to be equal when p = q and A t - Bi 
for i = 0,1 , 2 , ..., p. 

The sum A(A) + B( A) is a A-matrix (matrix polynomial) obtained by adding correspond¬ 
ing elements (terms) of the A-matrices (matrix polynomials). If p>q, its degree is p; 
if p = q, its degree is at most p. 

The product A(A)-J5(A) is a A-matrix (matrix polynomial) of degree at most p + q. 
If either A(A) or B( A) is non-singular (i.e., either |A(A)| ^0 or |£(A)| ¥* 0) then both 
A (A) • B{ A) and B( A) • A (A) are of degree p + q. Since, in general, matrices do not com¬ 
mute, we shall expect A(A) • B( A) ¥= B(k) • A(A). 
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The equality in ( 1) is not disturbed if A is replaced throughout by any kGf'. For 
example, 

A(k) = A p k v + A p -i k p ~ l + • • • + Aik + A 0 

When, however, X is replaced by an n-square matrix C over f, we obtain two results 
which are usually different 

A r (C) = A P C V + Ap-i C p_1 + ••• + AiC + A 0 (5) 

and A L (C) - C P A P + C p ~ l A p -1 + ••• + CAi + A 0 (S’) 

called, respectively, the right and left functional values of A (A.) when X = C. 

Example 3: 


When A(\) = 

then A„(0 = 

and A l (C) = 



DIVISION ALGORITHM 

The division algorithm for polynomials a(x),j8(x) in x over a non-commutative ring % 
with unity was given in Theorem II, Chapter 12, Page 126. It was assumed there that the 
divisor j3(x ) was monic. For a non-monic divisor, that is, a divisor /3(x) whose leading 
coefficient is 6„^1, the theorem holds only if 6“‘ € %. 

For the coefficient ring considered here, every non-singular matrix A has an inverse 
over f; thus the algorithm may be stated as 

If A (A) and B(X) are matrix polynomials (1) and (2) and if B q is non-singular, 
then there exist unique matrix polynomials Qi(A),iei(A); Q 2 (A), R 2 (X) G f[X], where 
#i(A) and R 2 ( A) are either zero or of degree less than that of B( A), such that 

A(A) = Q,(A)-R(a) + Ri( X) (1>) 

and A(A) = B(X) • Q 2 (x) + R 2 ( A) (£') 


If in (4) R i(A) = 0, 5(A) is called a right divisor of A(A); if in (4') R 2 ( A) = 0, 5(A) is 
called a left divisor of A(A). 


Example 4: 


Given 



find Qi(\), Ri(\); Q 2 (\), R 2 (\) such that 


(o) A(\) = Qj(\) >B(\) + R,(X), ( b) A(X) = B(\) • Q 2 (X) + R 2 (\) 


Here B x = 
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(a) We compute 

A(X) - A 3 B~ l X 2 B(X) 


3 5 
1 0 


0 4 

-1 1 


i T2 2~| 0 4 

c(\) - c 2 b;'\B(\) = 12 x + _i i 


D(\) - D^-’BM = 
Then Qy(\) = (A 3 X 2 + C 2 X + D 1 )B~ 1 = 


(b) We compute 

A(\) - B(\)B~ l A s \ 2 = 


0 0 
-4 2 


1 1 
0 1 


= C(\) 

D(\) 


0 2 
3-2 


X 2 + X x + 2 
3 X — 2 


, 3 M 

E{\) — 3(X)3| * E 2 X — 


3 5 
1 0 


0 4 
1 2 


0 4 
-1 1 


F(X) - B(X)B;'F l = ^ 5 | 

r 

Then Q 2 (X) = By 1 (A 3 X 2 4- E 2 X + F t ) 


:j -™ 

| = m 

« 2 (X) 


10 
-1 0 


3 2 

0-1 


0 4 
1-2 





X 2 + 3X 

2X + 4 





-X 2 + l 

—X — 2 

See Problem 5. 

For the n-square matrix B 

= [M 

over f, 

define its characteristic matrix as 


rx-6« 

i — 6l2 

-&13 . 

. . -6l» 



—bn 

A — b 22 

-623 .. 

— b 2 n 


Al-B = 

-bn 

—bs2 

A — 633 

. . 63 n 



_ —bni 

~bn2 

~bn 3 . 

.. A — 6„„_ 


With A (A) as in (1) and B( A) = 

A I-B 

, (4) and (4') yield 




A(A) = Q l {\)’{\I-B) + i?i 
and A(A) = (A I-B)- Q 2 (A) + R 2 

in which the remainders Ri and Ri are free of It can be shown, moreover, that 
Rt = A r (B) and R 2 = Al(B) 

Example 5: 


(5) 

(5') 


With A(X) = 


X 2 X - 1 
X + 3 X 2 + 2 


1 2 


we have X/ - 


X + 1 3 

3 X + 3 


7 it) 
12 17 


From Example 3, the remainders are = A R (B) — 


7 10 
12 17 


[3 X + 3 
and R 2 = A l (B) - 


7 8 

14 17 


7 8 

14 17 h 
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THE CHARACTERISTIC ROOTS AND VECTORS OF A MATRIX 

We return now to further study of a given linear transformation of V n (f) into itself. 
Consider, for example, the transformation of V = V a (R) given by 

2 2 ll e, - (2,2,1) 

A = 13 1 or * 2 - (1,3,1) 

1 2 2J -> (1,2,2) 

(It is necessary to remind the reader that in our notation the images of the unit vectors 
*i, e 2 > e 3 of the space are the row vectors of A and that the linear transformation is given by 
V -* V : £-»£A 

since he may find elsewhere the image vectors written as the column vectors of A. In 
this case, the transformation is given by 

V^V: £-A£ 

For the same matrix A, the two transformations are generally different.) 

The image of £ = (1,2,3) € V is 

2 2 1 

V = (1,2,3) 13 1 = (7,14,9) € V 

L 1 2 2 _ 

whose only connection with £ is through the transformation A. On the other hand, the 
image of £ x = (r, 2r, r) G V is 5£,, that is, the image of any vector of the subspace V 1 c V, 
spanned by (1,2,1), is a vector of V 1 . Similarly, it is easily verified that the image of any 
vector of the subspace V s C V, spanned by (1,-1,0), is a vector of V 2 ; and the image of 
any vector of V 3 C V, spanned by (1,0,-1), is a vector of V 3 . Moreover, the image of 
any vector (s + 1, -a, -t) of the subspace V 4 c V, spanned by (1, -1,0) and (1,0, -1), is 
a vector of the subspace generated by itself. We leave for the reader to show that the 
same is not true for either the subspace F s , spanned by (1,2,1) and (1,-1 0) or of V 6 
spanned by (1,2,1) and (1,0, -1). 

We summarize: The linear transformation A of F 3 (R) carries any vector of the sub- 
spa^e V 1 , spanned by (1,2,1), into a vector of V 1 and any vector of the subspace V 4 , spanned 
by (1, -1,0) and (1,0, -1), into a vector of the subspace generated by itself. We shall call 
any non-zero vector of V\ also of V 4 , a characteristic vector (invariant vector or eigenvector) 
of the transformation. 


In general, let a linear transformation of V = V„ (f) relative to the basis « £ 
be given by the n-square matrix A = [o«] over f. Any given non-zero vector 
(*i» X2 < , x n ) G V is a characteristic vector of A provided £4 = A£, i.e., 

(anXl + 021*2 +-1- a,l*„, 012*1 + 022*2 +-1- On 2 *n, . . . , 


for some A E f. 


Ol„*i + 0 2 n*2 + • • • + O n „*n) 


(A*i, A*2, . . ., A* n ) 


I = 
(*) 


We shall now use (6) to solve the following problem: Given A, find all non-zero 
vectors £ such that £A = A£ with A6f. After equating corresponding components in 
(0), the resulting system of equations may be written as follows 


(A - On)*! 

- 

021*2 

- a 3 i*3 - 

• • • — a»i* n 

= 0 

-ai2*i 

+ 

(A — 02s)*2 

— 032*3 — 

• • • - 0»2*n 

= 0 

-Ois*i 

- 

023*2 

+ (A — 033)*3 — 

' • • — On3*n 

= 0 

—ain*i 

- 

&2nX2 

- a 3 „*s - 

“h (X dnn)Xn 

= 0 


which by Theorem XVIII, Chapter 14, Page 181, has a non-trivial solution if and only if 
the determinant of the coefficient matrix 
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A. — an —a2i —asi 


—Bin 1 


—ai 2 A — 022 —032 
—Ol3 —O 23 A — 033 


|A/ - A T | = 0 


—Ol„ —fl2n —03n ... A — flnn 


where A T is the transpose of A. Now A I-A T = (A /-A) T (check this); hence, by Theorem 
XXII, Chapter 14, |A/-A t | = |A/-A|, the determinant of the characteristic matrix of A. 

For any n-square matrix A over |A/ — A T \ is called the characteristic determinant 
of A and its expansion, a polynomial <f>( A) of degree n, is called the characteristic polynomial 
of A. The n zeros A t , A 2 , A 3 , . . .,A n of <t>( A) are called the characteristic roots (latent roots 
or eigenvalues) of A. 

Now <j>(k) G f [A] and may or may not have all of its zeros in f. (For example, the 
characteristic polynomial of a 2-square matrix over R will have either both or neither of its 
zeros in R; that of a 3-square matrix over R will have either one or three zeros in R. One 
may then restrict attention solely to the subspaces of V 3 (R) associated with the real zeros, if 
any, or one may enlarge the space to Vz ( C) and find the subspaces associated with all of the 
zeros.) For any characteristic root A f , the matrix kJ — A T is singular so that the system of 
linear equations (7) is linearly dependent and a characteristic vector | always exists. Also, 

is a characteristic vector associated with A { for every scalar k. Moreover, by Theorem 
XVIII, Chapter 14, Page 181, when A/-A T has rank r, then (7) has n-r linearly inde¬ 
pendent solutions which span a subspace of dimension n — r. Every non-zero vector of 
this subspace is a characteristic vector of A associated with the characteristic root k ( . 


Example 6: 


Determine the characteristic roots and associated characteristic vectors of V 3 (R), 


given A - 


112 
0 2 2 
-113 


The characteristic polynomial of A is 

I X — 1 0 1 I 

|X/ — A T | = —1 X — 2 -1 = X3 - 6X2 + 11X - 6; 

I -2 -2 X - 3 I 

the characteristic roots are X t = 1, X 2 = 2, X 3 = 3; the system of linear equations 

(7) 18 f(V-l)*i + *3 =0 

(a) -1 -*! + (X - 2 ) x 2 - *3 =0 

[ 2% i - 2*2 + (X-3)z 3 = 0 


When X = Xj = 1, the system (a) reduces to 


J x t + x 2 = 0 

1 *3 = 0 ’ 


having = 1, 


x 2 = —1, *3 = 0 as a solution. Thus, associated with the characteristic root X t = 1 
is the one-dimensional vector space spanned by = (1, —1,0). Every vector 
(k,—k, 0), k # 0, of this subspace is a characteristic vector of A. 

When X = X 2 = 2, system (a) reduces to j^ 1 + ** _ jj, having = 2, 

x 2 = —1, *3 — —2 as a solution. Thus, associated with the characteristic root 
X 2 = 2 is the one-dimensional vector space spanned by ( 2 = (2, —1, —2), and every 
vector (2k,—k,—2k), k # 0, is a characteristic vector of A. 

*1 + * 2 — 0 


When X = X 3 = 3, system (a) reduces to 


[ 2 * x + *3 = 0 


having *x = 1, 


x 2 — —1, * 3 = —2 as a solution. Thus, associated with the characteristic root 
X 3 = 3 is the one-dimensional vector space spanned by £ 3 = (1,-1,—2), and every 
vector (k,—k,—2k), k # 0, is a characteristic vector of A. 
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Example 7: 


Determine the characteristic roots and associated characteristic vectors of V 3 (R), 


given A 


2 2 1 
13 1 
12 2 


The characteristic polynomial is 
|\/-AT| = 


X — 2 -1 -1 

-2 X - 3 -2 

-1 -1 X - 2 


: X 3 - 7X 2 + 11X - 5 ; 


the characteristic roots are X t — 5, X 2 — 1, X 3 — 1; and the system of linear equa¬ 
tions (7) is 


(X — 2)x 1 - x 2 - 
-2x t + (X — 3)x 2 - 
-x t - 


+ (X-2)z 3 


having 


When X — Xj — 5, the system (a) reduces to 1 

l *i ~ 

*i *!-2 i *8 = 1 as a solution. Thus, associated with X x = 5 is the one¬ 
dimensional vector space spanned by = (1,2,1). When X = X 2 = 1, the system 
(o) reduces to x t + x 2 + *, = 0 having a;, = 1, x 2 = 0, x s = -1 and *! = 1, x 2 = -1, 
X3-O as linearly independent solutions. Thus, associated with X 2 = 1 is the two- 
dimensional vector space spanned by { 2 = (1,0, — 1) and { 3 = (1,—1,0). 


The matrix of Example 7 was considered at the beginning of this section. Examples 6 
and 7, also Problem 6, suggest that associated with each simple characteristic root is a 
one-dimensional vector space and associated with each characteristic root of multiplicity 
m > 1 is an ^-dimensional vector space. The first is true but (see Problem 7) the second 
is not. We shall not investigate this matter here (the interested reader may consult any 
book on matrices); we simply state 


If A is a characteristic root of multiplicity m^l of A, then associated with A 
is a vector space whose dimension is at least 1 and at most m. 


In Problem 8 we prove 

Theorem III. If A 1( A 2 , | 2 are distinct characteristic roots and associated characteristic 

vectors of an n-square matrix, then and | 2 are linearly independent. 

We leave for the reader to prove 

Theorem IV. The diagonal matrix D = diag^, A 2> .. .,AJ has A P A 2 .A n as charac¬ 

teristic roots and « lf « 2 , •••,«„ as respective associated characteristic vectors. 


SIMILAR MATRICES 

Two n-square matrices A and B over f are called similar over f provided there exists 
a non-singular matrix P over f such that B- PAP 1 . 

In Problems 9 and 10, Page 213, we prove 
Theorem V. Two similar matrices have the same characteristic roots, 
and 

Theorem VI. If is a characteristic vector associated with the characteristic root A. of 
B - PAP ', then £. = £.P is a characteristic vector associated with the 
same characteristic root \ of A. 
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Let A, an n-square matrix over f having A.,, X 2 .A n as characteristic roots, be 

similar to D = diag(A 1( a 2 , .. ,,A n ) and let P be a non-singular matrix such that PAP~ l = D. 
By Theorem IV, e. is a characteristic vector associated with the characteristic root A 4 of D 
and, by Theorem VI, = eP is a characteristic vector associated with the same charac¬ 
teristic root Aj of A. Now tJP is the ith row vector of P; hence, A has n linearly independent 
characteristic vectors eP which constitute a basis of V„(?'). 

Conversely, suppose that the set S of all characteristic vectors of an n-square matrix A 

spans V n {f'). Then we can select a subset {^, | 2 .£„} of S which is a basis of V n (f). 

Since each is a characteristic vector, 


where A x , A 2 , ..., A n a: 

“a, 0 ... O' 

PA = 0 A 2 ... 0 

0 0 ... A n 


^A = Ajtj, ZA = A 2 { 2 . SA = A,{, 

the characteristic roots of A. With 


PAP 1 = diag (A x , A 2 , ..., A„) = D 


and A is similar to D. We have proved 


Theorem VII. An n-square matrix A over f, having A x , A 2 , ..., A n as characteristic roots 
is similar to D = diag (A x ,A 2 ,..., A J if and only if the set S of all charac¬ 
teristic vectors of A spans V n (f)- 



2 2 1 


"*'1 


1 2 1 

Example 8 : For the matrix A = 

1 3 1 

12 2 

of Example 7, take P = 

?2 


10-1 

1-10 


Then P~ l 


'i i i 
i i -i 
i -I i_ 


and 



1 2 1 

2 2 1 

i i i 


5 0 0 

PAP~ l = 

1 0 -1 ' 

1-10 

.13 1 

12 2 

‘ i i 
i -i i 


0 10 

0 0 1 


Not every n-square matrix is similar to a diagonal matrix. In Problem 7, Page 213, 
for instance, the condition in Theorem VII is not met since there the set of all characteristic 
vectors spans only a two-dimensional subspace of Va(P). 


REAL SYMMETRIC MATRICES 

An n-square matrix A - [ay] over R is called symmetric provided A T = A, i.e., 
Chj = an for all i and j. The matrix A of Problem 6, Page 212, is symmetric; the matrices 
of Examples 6 and 7 are not. 

In Problem 11, Page 214, we prove 

Theorem VIII. The characteristic roots of a real symmetric matrix are real. 

In Problem 12, Page 214, we prove 

Theorem IX. If Aj,^; A 2 , | 2 are distinct characteristic roots and associated charac¬ 
teristic vectors of an n-square real symmetric matrix, then £ x and | 2 are 
mutually orthogonal. 
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Although the proof will not be given here, it can be shown that every real symmetric 
matrix A is similar to a diagonal matrix whose diagonal elements are the characteristic 
roots of A. Then A has n real characteristic roots and n real, mutually orthogonal 
associated characteristic vectors, say, 

A,, I,; A 2 ,| 2 ; A n ,| n 

If, now, we define v . = | 4 /|| t |, (i = 1,2, .. .,n), 

A has n real characteristic roots and n real, mutually orthogonal associated characteristic 
unit vectors 

Vp A 2 , t? 2 ; . . .; A n , Vn 

~Vi 

V 2 

Finally, with S = • , we have SAS 1 = diag(A 1( a 2 , .. ,,A n ). 


The vectors tj 2 , .. .,r) n constitute a basis of F„(72). Such bases, consisting of mutu¬ 
ally orthogonal unit vectors, are called normal orthogonal or orthonormal bases. 

ORTHOGONAL MATRICES 

The matrix S, defined in the preceding section, is called an orthogonal matrix. We 
develop below a number of its unique properties. 

1. Since the row vectors Vi of S are mutually orthogonal unit vectors, i.e., = 

[ 1, when i = j 

i . . . ., it follows readily that 

[ 0, when i j 


~v t ~ 

v 2 

• fo. V 2 > • • • * V n ] = 

r Vi ‘Vt Vi‘V 2 ■ 
V 2 -V 1 V 2 ‘V 2 • 

■ ■ Vi-V„ 

■ ■ Vi’Vn 

Vn 


_Vn-Vt Vn-V 2 ■■ 

■ ■ Vn * Vn 


and S T = S~K 

2. Since S • S T = S T • S = 7, the column vectors of S are also mutually orthogonal unit 
vectors. Thus, 

A real matrix H is orthogonal provided H-H T = H T -H - I. 

3. Consider the orthogonal transformation Y = XH of V„ (72), whose matrix H is orthog¬ 
onal, and denote by Y u Y 2 respectively the images of arbitrary X U X 2 G V n (72). Since 

Y r Y 2 = Y t Yl = (X x H)(X 2 HY = X^H-H^Xl = X 2 X£ = X r X 2 , 
an orthogonal transformation preserves inner or dot products of vectors. 

4. Since |Yi| = (Fi • Yi) 1/2 = (Xi • Xi) 1/2 - |Zi|, an orthogonal transformation preserves 
length of vectors. 

5. Since cos 6' = = cos 0, where 0 ^ 0,0' < tt, we have 0' - 0. 

|Yi|*|Y 2 | |Ai|*|X 2 | 

In particular, if Xi’X 2 — 0 then Yi • Y 2 = 0, that is, under an orthogonal transfor¬ 
mation the image vectors of mutually orthogonal vectors are mutually orthogonal. 
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An orthogonal transformation Y = XH (also, the orthogonal matrix H) is called proper 
or improper according as |ff| = 1 or |£T| = — 1 . 

Example 9: 

For the matrix A of Problem 6, we obtain 


,, = {,/y = (2/V6, -1/^6, —1A/6), 1)2 = (1/V3, 1/V3,1A/3). ns = (0, 1/^2, -1/^2) 

Then, with 



Vl 


2/V6 

-l/y/6 -1/V6 



2/Ve 

W« 

0 

s = 

V2 


1/y/S 

1/V3 Vy/Z 

, s-‘ 

= S T — 

- 1 /V 6 

1 /V 3 

1 /V 2 


L’la 


0 

1/V2 -1/V2 



- 1 /V 6 

1 /V 2 

- 1 /V 2 


and we have S*A-S _1 = diag(9,3, —3). 


The matrix S of Example 9 is improper, i.e., |S| = -1. It can be verified easily that 
had the negative of any one of the vectors y 2 , y 3 been used in forming S, the matrix 
then would have been proper. Thus, for any real symmetric matrix A, a proper orthogonal 
matrix S can always be found such that S* A • S _1 is a diagonal matrix whose diagonal 
elements are the characteristic roots of A. 

CONICS AND QUADRIC SURFACES 

One of the problems of analytic geometry of the plane and of ordinary space is the 
reduction of the equations of conics and quadric surfaces to standard forms which make 
apparent the nature of these curves and surfaces. 

Relative to rectangular coordinate axes OX and OY, let the equation of a conic be 
given as ax 2 + 5^2 + 2 cxy + 2 dx + 2 ey + / = 0 (8) 

and, relative to rectangular coordinate axes OX,OY and OZ, let the equation of a quadric 
surface be given as 

ax 2 + by 2 + cz 2 + 2 dxy + 2 exz + 2 fyz + 2 gx + 2 hy + 2kz + m = 0 (9) 

It will be recalled that the necessary reductions are effected by a rotation of the axes to 
remove all cross-product terms and a translation of the axes to remove, whenever possible, 
terms of degree less than two. It will be our purpose here to outline a standard procedure 
for handling both conics and quadric surfaces. 

Consider the general conic of equation (8). Its terms of degree two, ax 2 + by 2 + 2cxy, 
may be written in matrix notation as 

ax 2 + by 2 + 2cxy = ( x,y ) • = X • E • X T 

where X - (x, y). Now E is real and symmetric; hence there exists a proper orthogonal 
matrix 5 = such that S-E-S" 1 = diag(X,,A 2 ) where A,,*; \ r v 2 are the charac¬ 

teristic roots and associated characteristic unit vectors of E. Thus there exists a proper 
orthogonal transformation X = (x', y')S = X'S such that 

X'S • E • S~ l X' T = X'[ A 0 ' ®]x" = V + M-* 
in which the cross-product term has 0 as coefficient. 
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Let S = 
(x,V) 

and we have 


S = P'l = h H then 
[>J |_^21 

= X = X'S = y = lv u x' + V21 y', Vn x' + n 22 y'} 


x = vu x ' + v 21 y' 

y = VuX' + VvV' 


This transformation reduces (8) to 

K x ’ 2 + \V' 2 + 2(d Vn + e Vl2 )x' + 2(d V21 + e v Jy' + f = 0 (S') 

which is then to be reduced to standard form by a translation. 

An alternate procedure for obtaining (S') is as follows: 

(i) Obtain the proper orthogonal matrix S. 

(ii) Form the associate of (S) 

a c d fx\ 

ax 2 + by 2 + 2cxy + 2dxu + 2eyu + fu 2 = ( x,y,u )• c b e - \y = X-F-X T = 0 
where X = (x, y, u). e W 

(iii) Use the transformation X = X' Jj , where X' = (x\ y’, u'), to obtain 


S 0 
0 1 


S T 0 
0 1 


X' T = 0 


the associate of (5'). 

Example 10: Identify the conic 5*2 - 2^3 xy + 7 y 2 + 20^3 x - 44 y + 75 = 0. 

For the matrix E - of terms of degree two, we find 4,(£V3,£); 8, (-£, ^\/3) 

as the characteristic roots and associated characteristic unit vectors and form S = j ^ 1 

L~* ivaj • 


(~s ol 

5 -Vs 10 Vs 

reduces X'F-XT = X 

—V3 7 -22 

L° X J 

10V3 -22 75 



tV3 | 0 

5 -V3 

10\/3 

*V3 -* 0 

X' 

iV3 0 

~Vz 7 

-22 

{ #0 


0 0 1 

10\/3 -22 

75 

0 0 1 


- 32\/3 j/V + 75m' 2 


= 0 8 -16^/3 I • | y' | = 4*' 2 

[_4 -16V3 76 

the associate of 4*' 2 + 8 y’ 2 + 8*' - 32 y/iy' + 75 = 4(*' + l) 2 + Z(y'-2y[Z) 2 - 
’ = *' + 1 
’ - V'- 2y/Z 


Under the translation 






it follows readily that, in terms of 


l v — i x> + iV3 y’ [ y' = y" + 2yfz 

the original coordinate system, the new origin is at 0"(-3V r 3/2, 5/2) and the new axes 0”X" and 
0"Y" have respectively the directions of the characteristic unit vectors (£\/3, 1) and (-£, ^3). 

See Problem 14. 
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Solved Problems 

A 2A+1 X + 2 

1. Reduce A(A) = A 2 + A 2A 2 + 2A A 2 + 2A to normal form. 

A 2 —2A 2A 2 — 2A. — 1 A 2 + A-3 

The greatest common divisor of the elements of A(X) is 1; set /j (\) = 1. Now use K Z1 (—2) 
followed by K 12 and then proceed to clear the first row and first column to obtain 


0 X 2 + X X* + 2X ~ 
2X — 1 X 2 — 2X X 2 + X — 3 


0 X 2 + X X 2 + 2X 
0 -X 2 -X —X 2 — 2X — 1 


0 X 2 + X X 2 + 2X 
2X -1 -X 2 - X -X 2 - 2X - : 


X 2 + X X 2 + 2X ... , 

The greatest common divisor of the elements of the submatrix _^ 2 _^ —> 2 — 2\ — 1 I W 861 

/ 2 (X) = 1. On B(\) use ff 23 ( 1) and X 23 (-l) and then proceed to clear the second row and second 
column to obtain 


|^0 X +1 -X 2 —2X-1J |^0 X +1 - 

10 0 10 0 

~ 0 1 0 ~ 0 1 0 
0 0 —X 2 — X 0 0 X 2 +> 

the final step being necessary in order that / 8 (X) = X 2 + X be monic. 


["* 0 ] and (6) B( A) = 

A® 0 0 

0 A 2 -A 0 

L0 A + lj 

0 0 A 2 


2. Reduce (a) A(A) = * ” \ and (6) B(X) = 0 A 2 -A 0 : 

i° A+I J L« o 

(a) The greatest common divisor of the elements of A (X) is 1. We obtain 

- Ti ,!.l - \\ - Lrix 


1 ° 1 

0 X 2 + X 


(6) The greatest common divisor of B(\) is X. We obtain 


0 X 2 -X 0 I ~ I 0 X 2 — X 0 


X X 2 - X 0 

~ —X 3 0 0 

0 0 X 2 


X 2 X 2 - X 0 
-X 3 + X 2 X 2 — X 0 
0 OX 2 


~ -x® + 

0 


X 0 0 

OX 2 0 

0 0 X 4 — X s 
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3. Reduce A (A) = 


A — 2 -1 -1 

-2 A - 3 -2 

-1 -1 A — 2 


to normal form. 


The greatest common divisor of the elements of A(\) is 
then proceed to clear the first row and column to obtain 


1. We use K 13 followed by Kj(—1) and 


A(\) - 


-1 X- 
x — 3 


0 X — 1 2 — 2X 

0 1-X X 2 — 4X + 3 


The greatest common divisor of the elements of B(\) is X -1; then 
10 0 ~| |~1 
AM ~ | 0 X — 1 2 — 2X I ~ I 0 X — 1 

1-X X 2 - 4X + 3 


4. Write A (A) = 


A + 2 A + 1 AH-3 
A A -3A 2 + A 
A 2 + 2A A 2 + A 3A 2 + 5A 


i polynomial in A and compute A(—2), 


A r (C) and A L (C) when C = 


1 0 2 
-1 -1 -4 
-1 0 -2 


1 1 1 
1 1 1 
2 15 


2 13 
0 0 0 
0 0 0 


0 0 ol 

-1 0 -2 

0 0-3 

4 1 10 


10 2 

-1 0 -2 

To 0 0 

4 1 10 

0 0-3 

10 2 

IL 1 1 3 _ 


Mi 


2 13 
0 0 0 
0 0 0 


-1 -4 
0 -2 


1 1 1 
1 1 1 
2 15 


2 13 
0 0 0 
0 0 0 

2 13 
0 0 0 
0 0 0 


Given A( A) = ^ 4 + ^ + 3A 2 H-A A< +A* + 2A 2 +A +ll Ta 2 + 1 A 2 -a] 

L A 3 - 2 A + 1 2A s — 3A 2 — 2 J and ~ La 2 + A 2A 2 +iJ’ 
find Q 1 (A),/e 1 (A);Q 2 (A),/e 2 (A) such that 


(a) A(a) = Qi(a) • B(x) + Ri(\) 

We have r 

A(X) = \l l X 4 + 


and ( 6 ) A(a) = B{ A) • Q 2 (A) + R z ( A). 
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o *»> - = [; *1 x. + r* jlx* + f_‘;]» + [; 41 


C(X) - C 3 B 2 - 1 -X-B(X) = [-i-3j xi 

D(X) - D 2 B“ l *B(X) = 0 = Bj(x) 

an d r- 

Qi(\) = (A 4 x 2 + C 3 x + D 2 )B~ 1 = 

Here, B(X) is a right divisor of A(X). 

(6) A(X) - B(X)«B 2 -*A 4 X* = f ® ®1 X* 


B(X) - B(X)-B-‘B 3 X 


F(X) - B(X)*B 2 _1 F 2 


0 -X - 1 
-X + 1 —2X 


Q 2 (X) = B” 1 (A 4 X 2 + B 8 X + F 2 ) = 


6. Find the characteristic roots and associated characteristic vectors of A - -2 


-2 4 lj 


The characteristic polynomial of A is |X/ —A T | 


= X s - 9X 2 - 9X + 81; 


the characteristic roots are X, = 9, X 2 = 3, X 3 = -3; and the system of linear equations (7) i 


(X — 7)*i + 2* z 

2xj + (X - 1)* 2 
2*j — 4*2 


When X = Xj = 9, (a) reduces to + 2* 3 = 0 navl,,B ~ 2 ~ ~ 3 * 

Thus, associated with X x = 9 is the one-dimensional vector space spanned by £i = (2, -1, -1). 


having x t = 2, * 2 = —1, * 3 = -1 as a solution. 


When x = X 2 = 3, (a) reduces to 


having x t = 1, — 1, x a — 1 as a solution. 


Thus, associated with X 2 = 3 is the one-dimensional vector space spanned by £ 2 - (1. *)• 


When X = X a = -3, (a) reduces to ^ + ^ _ 0 **»**"« -z -.-a * ““ 

Thus, associated with X 3 = -3 is the one-dimensional vector space spanned by £ 3 = (0,1, -1). 


having *i = 0, ** — 1, x 3 - 
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7. Find the characteristic roots and associated characteristic vectors of A — 
over R. 

lx l l| 

The characteristic polynomial of A is |A/- AT| = 2 X-l 1 = X»-3\2 + 4; the 

I 2 -2 X — 2 I 

characteristic roots are X, = —1, X 2 = 2, X 3 = 2; and the system of linear equations (7) is 

{ X*! + + * 3 =0 

2*i + (X — l)x 2 + x 3 =0 

2*i - 2*2 + (X — 2)* 3 = 0 

When X = X, = -1, the system (a) reduces to J *i ~ *2 = 0 havjn& ^ ^ x ^ _ 0 

[ *3 — 0 

j S _ a ^ S ° 1 1U ^ 0n ' Tlms ’ associated with Xj = —1 is the one-dimensional vector space spanned by 

When X = X 2 = 2, the system (a) reduces to J3 *i + * 8 = 0 havjng . . *, = -3 

[ *> * 2-0 

I S ^^ US ’ associated with X 2 = 2 is the one-dimensional vector space spanned by 

Note that here a vector space of dimension one is associated with the double root X 2 = 2 whereas 
m Example 7 a vector space of dimension two was associated with the double root. 


0 -2 -2 
-112 
-1 -1 2 


8. Prove: If A,, A 2 , £ 2 are distinct characteristic roots and associated characteristic 

vectors of A, then and | 2 are linearly independent. 

Suppose, on the contrary, that {, and { 2 are linearly dependent; then there exist scalars o, and o, 
not both 0, such that 

(i) «iii + «2{2 = 0 

Multiplying (i) on the right by A and using fjA = Xfo, we have 

(ii) «i£iA + a 2 t 2 A = + a 2 X 2 £ 2 = 0 

Now (i) and (ii) hold if and only if * = 0. But then X 1 = X 2 , a contradiction; hence, and f 2 

are linearly independent. I 1 2 I 2 


9. Prove: Two similar matrices have the same characteristic roots. 

Let A and B = PAP -1 be the similar matrices; then 

\I-B = \I-PAP-1 = P\IP~i-PAP~ i = P(x/-A)P-i 
and |X/~H| = |P(A/-A)P-i| = |P|-|X/-A|.|P-1| = |x/-A| 

Now A and B, having the same characteristic polynomial, must have the same characteristic roots. 


10. Prove: If is a characteristic vector associated with the characteristic root A. of 
B = PAP 1 , then £, = is a characteristic vector associated with the same charac¬ 
teristic root A, of A. 

By hypothesis, = A& and BP = (PAP-i)P = PA. Then f ( A = f,PA = fc£P = \ ifi p = Xi f- 
and ^ is a characteristic vector associated with the characteristic root \ { of A. ' 
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11. Prove: The characteristic roots of a real symmetric n-square matrix are real. 

Let A be any real symmetric matrix and suppose ft + tfc is a complex characteristic root. Now 
(ft + ik)I - A is singular as also is 

B = [(ft + tfc)/ - A] • [(ft-tfc)/-A] = (ft2 + fc 2 )/-2ftA + A* = (. hl-AP + m 
Since B is real and singular, there exists a real non-zero vector £ such that £B = 0 and, hence, 

£Bf = £{(ft/ - Ap + - {£(ft/-A)}{(ft/-A) T £ T } + fc 2 ££ T 

= V.V + fc 2 £ * { = 0 

where v = i(hl — A). Now v • v — 0 while, since £ is real and non-zero, £ • £ > 0. Hence, k- 0 
and A has only real characteristic roots. 


12. Prove: If A. A 2 , | 2 are distinct characteristic roots and associated characteristic 
vectors of an n-square real symmetric matrix A, then and | 2 are mutually orthogonal. 
By hypothesis, £jA = Xj£i and £ 2 A = X 2 £ 2 . Then 

£iA i 2 = and t*A = *2*2 

and, taking transposes, £ 2 A (f = Xj £ 2 (J and £iA £ 2 = X 2 £j £ 2 

Now £ x A & = X 1 £ 1 £j = X 2 £j £ 2 and (X! - X 2 )£j£j = 0. Since X, - X 2 * 0, it follows that £ x £ 2 = 
£, • £ 2 = 0 and £j and £ 2 are mutually orthogonal. 


13. (a) Show that a = (2,1,3) and p = (1,1,-1) are mutually orthogonal. 
( 6 ) Find a vector y which is orthogonal to each. 

(c) Use a,p, y to form an orthogonal matrix S such that |S| = 1. 


(a) a • p = 0; a and (3 are mutually orthogonal. 

(I b) y = aX/3 = (-4,5,1). 

(c) Take p j = a/|a| = (2/\/l4, l/VIi, 3/VH), Pi = P/\P\ = (1/V3, 1/y/S, —1/\/3) and p 3 = yl IyI = 
(—4/\Zi2, 5/x/42, 1/V42). Then 


Pi 


Pi 


2/Vl4 l/VIi 3/V14 


= 1 and S = 

Pi 

= 

1/y/z 1/V3 -1/^3 

P3 


P3 


-4/\/42 5/V42 1/-/42 


14. Identify the quadric surface 

3z 2 - 2 y 2 -z 2 - 4 xy - 8 xz - 12 yz - 8 x - 16 y - 34 z - 31 


For the matrix 



of terms of degree two, take 


0 


3, (2/3,-2/3,1/3); 6, (2/3,1/3,-2/3); -9, (1/3,2/3,2/3) 
s characteristic roots and associated characteristic unit vectors. Then, with S = 


X = (*, y, z, u) = (x',y',z',u') 


r s o" 

L° 


2/3 -2/3 1/3 

2/3 1/3 -2/3 

1/3 2/3 2/3 


S 0 
0 1 
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3 -2 -4 -4 


reduces 




X-F 

.XT 

= 

X 

-2 -2 

-4 -6 

-6 

-1 


j XT 












-4 -8 

-17 

-31J 






2/3 

-2/3 

1/3 

0 


3 

_ 

2 -4 -4 

I 

- 

2/3 

2/3 

1/3 

0 

t° 

X' 

2/3 

1/3 - 

-2/3 

0 


-2 

- 

2 -6 -8 



-2/3 

1/3 

2/3 

0 



1/3 

2/3 

2/3 

0 


-4 

- 

6 -1 -17 



1/3 - 

-2/3 

2/3 

0 



_ 0 

0 

0 

1_ 


_—4 


8 -17 -31 


_ 

0 

0 

0 

1_ 







~ 

3 

0 

0 —3~ 

A 

A 








- 

X 

, , , 



0 

6 

0 6 | 

V 

, i 









,V ,z. 

u) 


0 

0 

-9 -18 * 

* 

' 












-3 

6 

-18 -31 

V 

7 






= 3x'2 + 6j/'2 - 9z'2 - 6*v + 12 y'u' - 36zV - 31m'* = 0 


the associate of 


3*'2 + 6j/'« - 9z'2 - 6x' + 12j - 36 g' - 31 = 3(x' - 1)2 + 6(j/' 4-1)* - 9(*' + 2) 2 - 4 = 0 


{*" = *’~1 

Under the translation <y" = y' + 1 , this becomes 3x" 2 + 6j/" 2 - 9z" 2 = 4; the surface is a 
[*" = z' +2 

hyperboloid of one sheet. 

Using (x,y,z) = (x',y',z')S and the equations of the translation, it follows readily that, in 
terms of the original coordinate system, the new origin has coordinates (—2/3,-7/3,—1/3) and the 
new axes have the directions of the characteristic unit vectors (2/3, —2/3,1/3), (2/3,1/3, —2/3) 
(1/3,2/3,2/3). 


Supplementary Problems 


MX) = \ X2+X X + 1 1 and B(X) = [ X2 X2+X ], find 

[_\2 + 2 X J |_X-1 X J 

X X2 + 2X + 1 1 («) A(X)-R(X) = |" x4 + x8 + x2 -l 

-1 2X J |_ X 4 +3X 3 -X 

A(X)-R(X) = X ~^ + l] (d) B(X) . A(X) = P 

|_X 2 — X + 3 0 J J 


(«) MX) + B(x) = 


X* + 2X3 + 2X2 + X 
X* + X» + 3X2 + 2x 


2X 4 + 2X 3 + 2X2 + 2x 2X 3 + 2X 2 
2X 3 + X 2X2-1 


16. In each of the following find Q t (X), fi,(X); Q 2 (X), R 2 (X), where R^X) and R 2 (x) are either 0 or of degree 
less than that of R(X), such that A(X) = Q x (X) • B(X) + R t (X) and A(X) = B(X) • Q 2 (x) + R 2 (X). 


«.> BM ... r» ! +i x i 

X 4 + X 3 + X — 2 2X Z + X — 1J [ 1 X2 + X 




216 


MATRIX POLYNOMIALS 


[CHAP. 15 


’ [x* + X + 2 X 2 + 2X - 


f" 2X 2 + 2X 2X 2 1 . 

-TX-iJ' 

[x 3 - 2X 2 + X - 3 4X 2 + 2X + 3 "1 . 

[ —2X — 2 X 3 + 4X 2 + 6X + 3J ’ 


X X 
1 X 


~X — 2 
-1 



X s — X 2 + X + 4 

2X 2 + X 

X 2 + 4X — 2 


X — 1 1 0 

(d) A(X) = 

2 x 

3X 2 — 4X — 1 

X s + X 2 + 2X - 1 

X 2 + X 

4X 2 - 2X + 1 

X s — 2X 2 + 2X + 2 

; B(\) = 

-1 X +1 3 

2 0 X — 2 


An ‘- <•> = jV+x-i -i+«] ! B,(xl ■ [?-« 4 -i‘] 

«» = [;. 2 x ’ r ‘] ; f »] 

(*> «■<» = [“ 1 1 1} *‘ (X,= [l-°] 

« x >=[T ;;;]• 

<•> «- w = [x« B ‘ w = o 

«■<>•> = [x’r 6 2 x’+l + J'’ R,M = [n -»] 


X 2 X X + 3 I 


-2 0 4 

X + 1 X 2 + 1 X 

; *i(x) = 

2-3-2 

X - 2 X - 1 X 2 - 1 


1-2 3 3 


Q 2 W = 

X 2 X + 2 X + 4 

X - 2 X 2 X - 2 

; « 2 (x) = 

6 2 4 

8-2 7 


X — 2 X + 1 X 2 


-5 -2 -6 


17. Reduce each of the following to its normal form: 



X 2X 

2X — 1 


X 2 

0 

0 

(a) 

X 2 + 2X 2X 2 + 3X 
X 2 - 2X 3X 2 - 4X 

2X 2 + X — 1 

4X 2 - 5X + 2 

(d) 

0 

0 

X + 1 

0 

0 

x + l 


X 2 + 1 X 3 + X X 3 - X 2 
(6) X — 1 X 2 +1 -2X 

X 2 X 3 X 3 - X 2 +1 




-X x + l 

X + 2 


X — 2 

2 

3 

(«) 

-X 2 X 2 + X - 1 

X 2 + X + 1 -X 2 - 2X - 1 

X 2 + 2X - 1 
—X 2 — 3X — 2 

(f) 

-3 

2 

X + 3 

0 

4 

X + l 
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Am. (a) 

10 0 

0X0 

(c) 

10 0 

0 1 0 

(e) 

1 0 0 

0 1 0 


0 0 X 2 


0 0 1 


0 0 X 3 + 2X 2 + 11X + 20 


1 0 0 100 10 0 
(6) 0 X +1 0 (d) 0 X + 1 0 (/) 0 1 0 

0 0 X 3 +1 0 0 X s + X 2 0 0 X s + 2X 2 - 5X - 2 


Find the characteristic roots and associated characteristic vectors of each of the following matrices 
A over R. 



Am. (a) 6,(k,-k); -1, (6k,21c) (e ) l,(k,-k,-k); 2, (2k, -k,0); 3 ,(k,-k,k) 

(b) 0,(4k.k); 6,(2 k,-k) (f) -1, (k,2l,-k-t); 8, (2k, k, 2k) 

2 - (k > k ~> ( 9) 1, (3k, 2k, k); 2,(k,3k,k); -l,(k,0,k) 

(d) 0, (2k, k); 5, (k,-2k) 
where k¥- 0 and l ¥= 0. 


19. For an n-square matrix A, show 

(a) the constant term of its characteristic polynomial is (—1)" |A|, 

(ft) the product of its characteristic roots is |A|, 

(c) one or more of its characteristic roots is 0 if and only if |A| = 0. 


20 !?I?- T A e characteristic PoJy" 01 ™* 1 of an w-square matrix A is the product of the invariant factors 
Hint. From P(X) • (X/ — A) • S(\) = diag(A(X),/ 2 (X),.. .,/ n (X)) obtain 
W,«. woi.Wffl.L 


21 . 


For each of the following real symmetric matrices A, find a proper orthogonal matrix S such that 
SAS _1 is diagonal. 
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- 2A/6 -l/y/e -I/a/61 


2/3 

-1/3 

2/3 

(a) 

0 1/V2 -1/a/2 

(h) 

1/V2 

0 

-1/V2 


l/\/3 i/a/3 1/a/3_ 


_l/3\/2 

4/3V2 

1/3V2_ 


22. Identify the following conics: 

(а) 4x 2 + 24xj/ + llj/ 2 + 16x + 42j/ + 15 = 0 

(б) 9x 2 - 12xy + 4y 2 + 8\/l3 x + 12>/l3 y + 52 = 0 

(c) 3x 2 + 2xy + 3y 2 + 4\/2x + 12\/2y -4 = 0 

Ans. (a) Hyperbola, (6) Parabola, (c) Ellipse 

23. Identify the following quadric surfaces: 

(a) 3x 2 + 8y 2 + 3z 2 - 4xy - 4xz - 4yz - 4x - 2y - 4z + 12 = 0 

(5) 2x 2 - y 2 - 6z 2 - lOxy + 6yz + 50x - 74y + 42z + 107 = 0 

(c) 4x 2 + y 2 + z 2 — 4xy — 4xz + 2yz — 6y + 6z + 2 = 0 

(d) 2xy + 2xz + 2yz + 1 = 0 

Ana. (a) Elliptic paraboloid, (6) Hyperboloid of two sheets, (c) Parabolic cylinder 

24. Let A have X 1( X 2 ,..., X n as characteristic roots and let S be such that 

S • A • S -1 = diag (Xj, X 2 .X n ) = D 

Show that SAT'S-' = D when S = S ~>. Thus any matrix A similar to a diagonal matrix is similar 
to its transpose A T . 

25. Prove: If Q is orthonormal, then Q T = Q _1 - 

26. Prove: Every real 2-square matrix A for which |A| < 0 is similar to a diagonal matrix. 

27. Prove by direct substitution that A = £ jj is a zero of its characteristic polynomial. 

28. Under what conditions will the real matrix A = 

(а) equal characteristic roots, 

(б) the characteristic roots ±1. 


have 
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LINEAR ALGEBRA 

A set £ having binary operations of addition and multiplication, together with scalar 
multiplication by elements of a field f, is called a linear algebra over f provided 

(i) Under addition and scalar multiplication, £ is a vector space £(f) over f. 

(ii) Multiplication is associative. 

(iii) Multiplication is both left and right distributive relative to addition. 

(iv) £ has a multiplicative identity (unity) element. 

(v) (ka)p = a(kp) = k( a • p) for all «,pe£ and kef. 

Example 1: (a) The field C of complex numbers is a linear algebra of dimension (order) 2 over 

the field R of real numbers since (see Chapter 13) C(R) is a vector space of 
dimension 2 and satisfies the postulates (ii)-(v). 

(b) In general, if ^ is any field having f as a subfield, then f(f) is a lirnar algebra 
over f. 

Example 2: Clearly, the algebra of all linear transformations of the vector space V n (f) is a 
linear algebra of order n 2 . Hence the isomorphic algebra M n (f) of all x-square 
matrices over f is also a linear algebra. * 


AN ISOMORPHISM 

The linear algebra of Example 2 plays a role here similar to that of the symmetric 
group Sn in group theory. In Chapter 9 it was shown that every abstract group of 
order n is isomorphic to a subgroup of S n . We shall now show that every linear algebra 
of order n over f is isomorphic to some subalgebra of M n (f). 

Let ^ be a linear algebra of order n over f having {x u x 2 ,x 3 , ....x n ) as basis. With 
each a G Jl, associate the mapping 

T a : X Ta - X’a, X £ 

By (iii), xT a + yT a = x-« + y-a = (x + y)a = (x + y)T a 

and by (v), (kx) T a = (kx) a = k(x •«) = k(x T a ) 

for any x,y G. £ and k E f. Hence Ta is a linear transformation of the vector space 
£{f). Moreover, the linear transformations T a and T e associated with the distinct elements 
a and p of Jl are distinct. For, if a ¥= p, then u • a ¥* u* p, where u is the unity of jC, 
implies T a ¥> Tp. 

Now, by (iii) and (v), 

xT a + xTp = X-a + X-p = x(a + p) = xT^p 
(xT a )Tp = (x-a)p = x(a ’ P) = xT a .p 
and (kx) T a = (kx)a = k(x-a) = x-k a = xT ka 
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Thus the mapping a -> T a is an isomorphism of onto a subalgebra of the algebra of all 
linear transformations of the vector space £(T)- Since this, in turn, is isomorphic to a 
subalgebra of M n {f), we have proved 

Theorem I. Every linear algebra of order n over f is-isomorphic to a subalgebra of M n (f). 

Example 3: Consider the linear algebra Q[v^2] of order 3 with basis {1, \/2, \/4}. For any 
element a = a x 1 + a 2 \Z2" + a 3 of Q[v^2], we have 

1 • a = a x 1 + a 2 yj 2 + a 3 yfl 
y/ 2 -a = 2a 3 1 + a t y /2 + a 2 yfi 
yfi • a = 2a 2 1 + 2o 3 V^2 + 


Then the mapping 

1 + a 2 y/2 + a 3 \[i 


a 2 a 3 

2o 3 ttj a 2 
2 a 2 2 a 3 a, 


is an isomorphism of the linear algebra Q[yj 2 ) onto the algebra of all matrices of 


M n (Q) of the form 


2f r 
2s 2 1 


See also Problem 1. 


Solved Problems 


1. Show that = {a x • 1 + a 2 « + a 3 p : a ( G R} with multiplication defined such that 
1 is the unity, 0 = 0-l + 0-a + 0-/? is the additive identity, and 


(a) 


• « P 

a a p 

poo 


are linear algebras over R. 


and (6) 


• 1 « ft 

a a 0 

poo 


We may simply verify in each case that the postulates (i)-(v) are satisfied. Instead, we prefer to 
show that in each case -C is isomorphic to a certain subalgebra of M 3 (R). 

(a) For any a = a x • 1 + a^a + a^P, we have 

1 • a — a x l + + o 3 j8 

a-a — («! + a 2 )oc + a 3 p 
P’ a - * X P 

Hence Ji is isomorphic to the algebra of all matrices M 3 (R) of the form 
and is a linear algebra over R. 

(b) For any a — o x • 1 + a 2 a + a 3 p, we have 

1 • a — ttjl + o<ja + a 3 p 
a’ a — (oj + ag)a 
P ' a — a x p 


a x a 2 a 3 
0 a x + a 2 a 3 
0 0 a x 


Hence £ is isomorphic to the algebra of all matrices M 3 (R) of the form 


a x 4- a 2 0 
0 a x 
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Supplementary Problems 

2. Verify that each of the following, with addition and multiplication defined as on R, is a linear algebra 
over Q. 

(а) Q[V3] = {al + by/ 3 : a,b G Q> 

(б) £ = {al + by /3 + ey/b + dy/ 15 : a,b,c,d G Q} 


Show that the linear algebra ^ — Q{y/t\, where t G N is not a perfect square, is isomorphic to the 
algebra of all matrices of M 2 (Q) of the form \ 1 . 


4. Show that the linear algebra C over R is isomorphic to the algebra of all matrices of M 2 (R) of the 


5. Show that each of the following is a linear algebra over R. Obtain the set of matrices isomorphic 
to each. 

(а) -C = {al + 6a + ca 2 : a, 6, c G R}, where G = {a, a 2 , a 3 = 1} is the cyclic group of order 3. 

(б) ^ = {djl + a 2 x + a :t y : a ; G R>, with multiplication - defined so that 1 is the unity, 0 = 
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BOOLEAN ALGEBRA 

A set <B, on which binary operations U and fl are defined, is called a Boolean algebra 
provided the following postulates hold: 

(i) U and n are commutative. 

(ii) <B contains an identity element 0 with respect to U and an identity element 1 
with respect to n. 

(iii) Each operation is distributive with respect to the other, i.e., for all a, 5, c G ‘B 

a U (6 n c) = (a U b) n (a U c) 
and a n (b U c) = (a n 6) U (a n c) 

(iv) For each a G 'B there exists an a' G 1 B such that 

a U o' = 1 and a n a' = 0 

The more familiar symbols + and • are frequently used here instead of U and n. We 
use the latter since if the empty set 0 be now denoted by 0 and the universal set U be now 
denoted by 1, it is clear that the identities 1.9-1.9', 1.4-1.4', 1.10-1.10', 1.7-1.7', proved valid 
in Chapter I for the algebra of all subsets of a given set, are precisely the postulates 
(i)-(iv) for a Boolean algebra. Our first task then will be to prove, without recourse to 
subsets of a given set, that the identities 1.1, 1.2-1.2', 1.5-1.5', 1.6-1.6', 1.8-1.8', 1.11-1.11' 
of Chapter 1 are also valid for any Boolean algebra, that is, that these identities are 
consequences of the postulates (i)-(iv). It will be noted that there is complete symmetry 
in the postulates with respect to the operations U and n and also in the identities of (iv). 
Hence there follows for any Boolean algebra the 

Principle of Duality. Any theorem deducible from the postulates (i)-(iv) of a 
Boolean algebra remains valid when the operation symbols U and n and the identity 
elements 0 and 1 are interchanged throughout. 

As a consequence of the duality principle it is necessary to prove only one of each pair of 
dual statements. 

Example 1: Prove: For every »£?, 

a U a — a and 

Using in turn (ii), (iv), (iii), (iv), (ii): 
a U a = (a u a) n 1 = (o U o) n (a U 

Example 2: Prove: For every nG?, 

o U 1 = 1 and o n 0 = 0 (2) 

(See 1.5-1.5', Chapter 1, Page 5.) 

Using in turn (ii), (iv), (iii), (ii), (iv): 

o n 0 = 0 u (a n 0) = (a n o') U (a n 0) = a n (o' u 0) = a n a' = 0 
222 


a n a = a (1) 

(See 1.6-1.6', Chapter 1, Page 5.) 

o') = o U (o n o') = a U 0 = a 
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Example 3: Prove: For every a,bG c B, 

a U (a n 6) = a and a n (a U 6) = a (j) 

Using in turn (ii), (iii), (2), (ii): 

« u (a n b) = (o n 1) u (a n 6) = a n (1 u b) - a n 1 = a 

See also Problems 1-4. 

BOOLEAN FUNCTIONS 

Let <B = {a,b,e, ...) be a Boolean algebra. By a constant we shall mean any symbol, 
as 0 and 1, which represents a specified element of <B; by a variable we shall mean a symbol 
which represents an arbitrary element of <B. If in the expression x'U (y n z) we replace 
U and n by ‘ to obtain *' + »**. it seems natural to call x' and y n 2 monomials 
and the entire expression x' U {y C\ z) a polynomial. 

Any expression as x U x', a n b', [o n (b U c')] U (a' n b’ n c) consisting of combina¬ 
tions by U and n of a finite number of elements of a Boolean algebra H will be called a 
Boolean function. The number of variables in any function is the number of distinct 
letters appearing without regard to whether the letter is primed or unprimed. Thus x u x' 
is a function of one variable x while a n b' is a function of two variables a and b. ’ 

In ordinary algebra every integral function of several variables can always be expressed 
as a polynomial (including 0) but cannot always be expressed as a product of linear factors 
In Boolean algebra, on the contrary, Boolean functions can generally be expressed in 
polynomial form (including 0 and 1), i.e. a union of distinct intersections, and in factored 
form, i.e. an intersection of distinct unions. 

Example 4: 

Simplify: 

(a) (xny)u [(* u V ') n y]', ( b ) [(x u y') n (x n y' n *)']', (c) {[(*' n y')' u z] n (x u *)}' 

(а) (*n»)u [(x u y') n y\ = (x n y) U [(x u y'Y u y'] = (x n y) U [(x' n y) u y'} 

= (*n»)u [(*' u y’) n (y u y')} = (*nif)u [(»' u 1 ,') n i] 

= (xny)u (*' u y') = (* n y) u (x n y)' = l 

(б) [(* u y’) n(xny'n *)']' = (* u y')' u ((x n/n *)']' 

= (x n y) U (x n y' n x), a union of intersections. 

[(x u y') n (x r\y' n *)']' = (x' n y) u (x n y' n ») 

= (x' u x) n (x' u y') n (x' u *) n (x u k) n (v u y') n (y u z) 

= ln(x'u y') n (x' u z) n (x u y) n 1 n (y u *) 

- (x u y) r\ (y u z) n (x' u z) n (x' u y'), an intersection of unions. 

(c) {[(x' n y'Y u*]n(xu z)}' = [(*' n y'Y u *]' u (x u z)' 

= (*' ni/’n z') u (*' n z') = x'n z' (by Example 3) 

See also Problem 5. 

i Sm ^ e ( see Prob,em 15 > Pa ^ e 234) there exists a Boolean algebra having only the 
elements 0 and 1, any identity may be verified by assigning in all possible ways the values 
0 and 1 to the variables. 

Example 5: To verify the proposed identity (see Example 4(a)) 

(x n y) U [(x U y') fl »]' = 1 
we form the following Table 17-1. 
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X 

V 

a = x n y 

x U y' 

b = (xuj')nt 

a u b' 

1 

1 

1 

0 

1 

0 

1 

1 

1 

0 

1 

1 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

0 

1 


Table 17-1 


NORMAL FORMS 

The Boolean function in three variables of Example 4(6) when expressed as a union 
of intersections {x’ n y) U (x n y' n z) contains one term in which only two of the vari¬ 
ables are present. In the next section we shall see that there is good reason at times to 
replace this expression by a less simple one in which each term present involves all of the 
variables. Since the variable z is missing in the first term of the above expression, we 
obtain the required form, called the canonical form or the disjunctive normal form of the 
given function, as follows 

(*' ny)u(xnynz) = (*' n y n l) u (x n y n z) 

= [(*' n y) n (z u z')] u(xfiy'nz) 

= (*' n y n z) u (*' n ?/ n «') u (* n y n z) 

See also Problem 6. 

It is easy to show that the canonical form of a Boolean function in three variables can 
contain at most 2 3 distinct terms. For, if x,y,z are the variables, a term is obtained by 
selecting x or x', y or y', z or z’ and forming their intersection. In general, the canonical 
form of a Boolean function in n variables can contain at most 2" distinct terms. The 
canonical form containing all of these 2“ terms is called the complete canonical form or 
complete disjunctive normal form in n variables. 

The complete canonical form in « variables is identically 1. This is shown for the 
case n = 3 in Problem 7, Page 231, while the general case can be proved by induction. It 
follows immediately that the complement F' of a Boolean function F expressed in canonical 
form is the union of all terms of the complete canonical form which do not appear in the 
canonical form of F. For example, if F = (x n y) U (*' D y) U (x' n y), then F'= (x D y ). 

In Problems 8 and 9, we prove 

Theorem I. If, in the complete canonical form in n variables, each variable is assigned 
arbitrarily the value 0 or 1, then just one term will have the value 1 while 
all other terms will have the value 0. 

and 

Theorem II. Two Boolean functions are equal if and only if their respective canonical 
forms are identical, i.e., consist of the same terms. 

The Boolean function in three variables of Example 4(6), when expressed as an inter¬ 
section of unions in which each union contains all of the variables, is 
(X u y) n (y u z) n ( x ' u z) n ( x ' u y) 

= [(x u y) u (* n Z')] n [(» u z) u (x it*')] n [(*' uz)u(»n y)] n [(*' u y) u (z n z')] 
= (xuizuz)n(iuyu Z') n (x'uyuz)n (*' uy'uz)n (*' uyu z’) 

This expression is called the dual canonical form or the conjunctive normal form of the 
function. Note that it is not the dual of the canonical form of that function. 
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The dual of each statement concerning the canonical form of a Boolean function is a 
valid statement concerning the dual canonical form of that function. (Note that the dual 
of term is factor.) The dual canonical form of a Boolean function in n variables can contain 
at most 2" distinct factors. The dual canonical form containing all of these factors is 
called the complete dual canonical form or the complete conjunctive normal form in n 
variables; its value is identically 0. The complement F' of a Boolean function F expressed 
in dual canonical form is the intersection of all factors of the complete dual canonical form 
which do not appear in the dual canonical form of F. Also, we have 

Theorem I'. If, in the complete dual canonical form in n variables, each variable is 
assigned arbitrarily the value 0 or 1, then just one factor will have the 
value 0 while all other factors will have the value 1. 
and 

Theorem II'. Two Boolean functions are equal if and only if their respective dual canonical 
forms are identical, i.e., consist of the same factors. 

In the next section we will use these theorems to determine the Boolean function when 
its values for all possible assignments of the values 0 and 1 to the variables are given. 


CHANGING THE FORM OF A BOOLEAN FUNCTION 

Denote by F(x,y,z) the Boolean function whose 
values for all possible assignments of 0 or 1 to the 
variables is given by Table 17-2. 

The proof of Theorem I suggests that the terms 
appearing in the canonical form of F(x,y,z) are 
precisely those of the complete canonical form in 
three variables which have the value 1 whenever 
F(x, y>z) = 1- For example, the first row of the table 
establishes x n y n z as a term while the third row 
yields x n y' n z as another. Thus, 

F(x,y,z) = (a n y n *) U {x n y' n z) U (x' n 
= (x n z) u (*' n y') 

Similarly, the factors appearing in the dual canonical form of F(x,y,z) are precisely 
those of the complete dual canonical form which have the value 0 whenever Fix, y, z) = 0. 
We have 

F(x,y,z) = (x' U y' U z) n (x U y' U z') n (x' U y U z) n (x U y' U z) 

= (*' u *) n (x U y') See Problem 10. 

If a Boolean function F is given in either the canonical or dual canonical form, the 
change to the other form may be readily made by using successively the two rules for 
finding the complement. The order in their use is immaterial; however, at times one order 
will require less computing than the other. 

Example 6: Find the dual canonical form for 

F = ix n y n z’) u (x n y' n z)u ix n y' n z’) u (x' n y n z) u (x' n y' n z') 

Here F’ = (as n y n z) u (»' n y' n *) u (*' n y n *') 


X 

V 

* 

F(x, y, z) 

1 

1 

1 

1 

1 

0 

0 

1 

0 

1 

1 

0 

1 

1 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

1 

0 

_2_J 

0 

1 


Table 17-2 


V n z) u (x' n y n z r ) 
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(the union of all terms of the complete canonical form not appearing in F) and 
F = (F'Y = (x U y U z') n (x U y' U z) n (x' U»'U z') (by Problem 4) 

See also Problem 11. 

The procedure for changing from canonical form to dual canonical form and vice versa 
may also be used advantageously in simplifying certain Boolean functions. 

Example 7: Simplify F = [(y n z') u (y' n z)] n [(*' n y) u (*' n z) u (x n y' n z')] 

Set Fj = (yn z’) U ( y ' n z) and F 2 = (*’ n y) u (x' n z) u (x n y’ n z'). 

Then F t = (x n y n z') u (»' n y n z') u(inj'nz) u ( x ' nj/'nz) 

F,' = (*' n y n z) u (*' n y' n z') u (x n y n z) u (x n y’ n z') 
and F t = (*u»'u*Tn(*u»uz)n (*' u*'u z') n (*' u y u z) 

Also F 2 = (x' n y n z) u (x' n y n z') u (x' n y' n z) u (x n y' n z') 

F 2 = (x' n y' n z') u (x n y n z) u (x n y' n z) u (x n y n z') 
and F 2 = (x U y U z) n (x' U y' U z') n (x' U y U z') n (x' Uj/'Uz) 

Then F = F,nF, 

- (x u y u z) n (x u y' u z') n (x' u y u z) n (x' u y u z') 
n (x' u y' u z) n (x' uy'u z') 

Now F' = (x U y' U z) n (x U y U z') 

and F = (x' n y n »') U (x' ny'nz) = x' n [(y n z') u (y' n z)] 


ORDER RELATION IN A BOOLEAN ALGEBRA 

Let U = {a, b, c] and S= { <p,A,B,C,D,E,F,U } where A = {a}, B = {6}, C = {c}, 
D = {a, 6}, E = {a, c), F = {6,c}. The relation c, defined in Chapter 1, when applied 
to S satisfies the following laws: 


For any X,Y,Z G S, 

(a) XQX 

(b) If Xc7 and Y C X, then X=Y. 

(c) If X C Y and Y Q Z, then XcZ. 

(d) If X c Y and X c Z, then X c (Y n Z). 

(e) If Xc Y, then X c {Y U Z). 

(/) X C y if and only if Y' C X'. 

(g ) IcY if and only if X U Y - Y or the equivalent X n Y' - 0. 

The first three laws assure (see Chapter 2) that C effects a partial ordering in S illus¬ 


trated by 



Fig. 17-1 
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We shall now define the relation C (read, “under”) in any Boolean algebra <B by 
a Qb if and only if a U b — b or the equivalent a n b' = 0 

for every a, b G 'B. (Note that this is merely a restatement of (g) in terms of the elements 

of < B.) There follows readily 
(a i) a c a 

(fti) If a c b and 6 C a, then a — b. 

(ci) If a C b and b c c, then a c c. 

so that C defines a partial order in C B. We leave for the reader to prove 
(di) If a c b and a c c, then a c (b Cl c). 

(ei) If a c b, then a C (6 U c) for any cG®, 

(ft) a C b if and only if b' c a'. 

In Problem 12, we prove 

Theorem III. For every a, b G ‘B, a U 6 is the least upper bound and a n ft is the greatest 
lower bound of a and ft. 

There follows easily 

Theorem IV. 0 Q b C 1, for every 6 € C B. 

ALGEBRA OF ELECTRICAL NETWORKS 

The algebra of electrical networks is an interesting and highly important example of 
the Boolean algebra (see Problem 15) of the two elements 0 and 1. The discussion here 
will be limited to the simplest kind of networks, that is, a network consisting only of 
switches. The simplest such network consists of a wire containing a single switch r: 


When the switch is closed so that current flows through the wire, we assign the value 1 
to r; when the switch is open so that no current flows through the wire, we assign the 
value 0 to r. Also, we shall assign the value 1 or 0 to any network according as current 
does or does not flow through it. In this simple case, the network has value 1 if and only 
if r has value 1 and the network has value 0 if and only if r has value 0. 

Consider now a network consisting of two switches r and s. [When connected in series: 


Fig. 17-2 


it is clear that the network has value 1 if and only if both r and s 
have value 1, while the network has value 0 in all other assign¬ 
ments of 0 or 1 to r and s. Hence this network can be repre¬ 
sented by the function F = F(r,s) which satisfies Table 17-S2 
We find easily F = rn«:j [When connected in parallel: 



r s F 

1 1 1 

10 0 
0 10 

0 0 0 

Table 17-1 & 


Fig. 17-3 
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it is clear that the network has value 1 if and only if at least 
one of r and s has value 1, and the network has value 0 if and 
only if both r and s have value 0. This network can be repre¬ 
sented by the function F = F(r,s) which satisfies Table 17-43 
We find easily F =rUs] For the various networks consisting 
of three switches, see Problem 13. 

Using more switches, various networks of a more complicated 
nature may be devised; for example, 


r s F 

_ _ - 

1 0 1 

0 1 1 

0 0 0 


Table 17-# % 


-c> 



Fig. 17-4 

The corresponding function for this network consists of the intersection of three factors: 

(rUs)nin(#Ui?Uw) 

So far all switches in a network have been tacitly assumed to act independently of 
one another. Two or more switches may, however, be connected so that (a) they open 
and close simultaneously or (6) the closing (opening) of one switch will open (close) all of 
the others. In case (o), we shall denote all switches by the same letter; in case (b), we 
shall denote some one of the switches by, say, r and all others by r’. In this case, any 
primed letter has value 0 when the unprimed letter has value 1 and vice versa. Thus the 
network 


— 1 

Fig.17-5 

consists of three pairs of switches: one pair, each of which is denoted by t, open and close 
simultaneously and two other pairs, denoted by r, r' and s, s', such that in each pair the 
closing of either switch opens the other. The corresponding function is basically a union 
of two terms each involving the three variables. For the upper wire we have r n s' n t 
and for the lower (t U s) n r'. Thus, the function corresponding to the network is 
F = (rns'ntju [(f U s) n r'] 

and the table giving the values (closure properties) of the function is 



Table 17-5 
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It is clear that current will flow through the network only in the following cases: 
(1) r and t are closed, s is open; (2) s and t are closed, r is open; (3) s is closed, r and t 
are open; (4) t is closed, r and s are open. 

In further analysis of series-parallel networks it will be helpful to know that the 
algebra of such networks is truly a Boolean algebra. In terms of a given network, the 
problem is this: Suppose F is the (switching) function associated with the network and 
suppose that by means of the laws of Boolean algebra this function is changed in form to 
G associated with a different network. Are the two networks interchangeable; in other 
words, will they have the same closure properties (table)? To settle the matter, first 
consider Tables 17-3 and 17-4 together with their associated networks and functions 
rfls and r U s respectively. In the course of forming these tables, we have verified that 
the postulates (i), (ii), (iv) for a Boolean algebra hold also for network algebra. For the 
case of postulate (iii), consider the networks 



Fig.17-6 

corresponding to the Boolean identity a U (6 n c) = (a U b) n (a U c). It is then clear 
from the table of closure properties 


a 

6 

c 

o u (6 n c) 

(a u b) n (a u e) 

i 

1 

1 

1 

1 

i 

1 

0 

1 

1 

i 

0 

1 

1 

1 

0 

1 

1 

1 

1 

1 

0 

0 

1 

1 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 


Table 17-6 


that the networks are interchangeable. 

We leave for the reader to consider the case of the Boolean identity 
o n (b U c) = (a n b) U (a n c) 
and conclude that network algebra is a Boolean algebra. 

SIMPLIFICATION OF NETWORKS 

Suppose now that the first three and last columns of Table 17-5 are given and we are 
asked to devise a network having the given closure properties. Using the rows in which 
F = 1, we obtain 

F = (rns'nt)u(r , njnt)u(r'nsnt')u (r' n s' n t) = (r' n s) u (s' n t) 

Since this is not the function (network) from 
which the table was originally computed, the 
network of Fig. 17-5 is unnecessarily complex 
and can be replaced by the simpler network, 
shown in Fig. 17-7. 




Fig. 17-7 
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Note. The fact that one of the switches of Fig. 17-7 is denoted by r' when there is no switch 
denoted by r has no significance here. If the reader has any doubt about this, let him 
interchange r and r' in Fig. 17-5 and obtain F = (r n s) U (s' n t) with diagram 



Fig. 17-8 


See Problem 14. 


Solved Problems 

1. Prove: U and n are associative, i.e., for every a,b,c E “B 

(a U b) U c = a U (6 U c) and (a n b) n c = a n (b n c) (4) 

(See 1.8-1.8', Chapter 1, Page 5.) 

Let * = (a n b) n c and y = a n (b n e). We are to prove x — y. Nov;, using (iii) and (3), 
oUli = a u [(a n b) n e] = [a u (a n 6)] n (a u e) 

- a n (a U e) = a = a u [a n (6 n e)] = aU y 

and o' u * = o' u [(a n b) n e] = [o' u (a n b)] n (o' u e) = [(o' u o) n (o' u 6)] n (o' u e) 

= [1 n (o' u 6)] n (o' ue) = (o' u b) n (o' u e) = o' u (6 n c) 

= (o' u o) n [o' u(bn c)] = o' u [o n (b n c)] = a' u y 

Hence (o U *) n (o' U *) = (a U y) n (o' U y) 

(a n o') u x = (an o') u y 

and x = y 

We leave for the reader to show that as a consequence parentheses may be inserted in 
a t U o 2 u • • ■ u o„ and u,no 2 n fle B at will. 

2. Prove: For each a€f, the element a' defined in (iv) is unique. 

Suppose the contrary, i.e., suppose for any o£? there are two elements o', o" G “B such that 
o U o' = 1 ana' = 0 

a u a" = 1 o n a" = 0 

Then a' = 1 n a' = (a U a") no' = (on o') U (a" n o') 

= (a n a") U (a" n o') = a" n (o U o') = a" n 1 = a" 

and o' is unique. 

3. Prove: For every a,b e “B 

(a U b)' = a' n b' and (a n 6)' = a' U b' (5) 

(See 1.11-1.11', Chapter 1, Page 5.) 

Since by Problem 2 there exists for every x G <B a unique x' such that x U x' = 1 and 
x n x' = 0, we need only verify that 

(a u b) U (o' n b') = [(a U 6) U o'] n [(a U b) U 6'] = [(a U o') Ub]n [a U (6 U 6')] 

= (1 u 6) n (a u 1) = 1 n 1 = 1 
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and (we leave it for the reader) (o U 6) n (o' n 6') = 0 

Using the results of Problem 2, it follows readily that 

(a, u o 2 u • • • u o B )' = a; n n • • • n a' 
an< * («t n «2 n - • • n a „)' = o' u o' u • • • u o' 


4. Prove: (a')' = a for every a G <B. (See 1.1, Chapter 1, Page 5.) 

(a')' = l n (o')' = (o u o') n (o')' = [o n (o')'] u [o' n (o')'] = [o n (o')'] u o 

= 0 u [o n (o')'] = (o n o') u [o n (o')'] = a n [o' u (o')'] = onl = o 

5. Simplify: [x U («' U y)’} D [x U (y' n *')']• 

[* u <*' u »)'] n [* u (»' n *')'] = [x u (* n y’)} n [x u (y u *)] = x n [* u (y u *)] = * 

6. Obtain the canonical form of [x U (s' U y)’] D [s U (y' n «')'], 

Using the identity of Problem 5, 

[* u (*' u y)'] n [* u (y' n z’Y] = * = * n (y u y') n (2 u z') 

= (* n y n z) u (x n y' n z) u (x n y n *') u (* n y’ n z') 


Prove: The complete canonical form in 3 variables is identically 1. 

First, we sho\v that the complete canonical form in 2 variables 

(* n v) u (* n y') u (»' n»)u (*' n y') = [(* n f )u(m y')} u [(*' n y) u (*' n y')] 
= [* n (y u y’)\ u [*' n (y u y')\ 

= (* u *') n (y u y') = l n l = i 

Inen the canonical form in 3 variables 

[(* n y n *) u (* n y n *')] u [(* n y' n z) u (* n y' n 2 ')] 

u [(*' n y n 2 ) u (*' n y n 2 ')] u [(*' n y' n 2 ) u (*' n y' n 2 ')] 

= [(* o y) u (x n y') u (*' n y) u (*' n y ')] n (2 u 2 ') = 1 n 1 = 1 


8. Prove: If, in the complete canonical form in n variables, each variable is assigned 
arbitrarily the value 0 or 1, then just one term will have the value 1 while all others 
will have the value 0. 

Let the values be assigned to the variables x u x 2 , .. The term whose value is 1 contains 

x i x i has the value 1 assigned or *' if x, has the value 0 assigned, x 2 if x 2 has the value 1 or 
x' if x 2 has the value 0, ..., x, if x„ has the value 1 or x' n if x„ has the value 0. Every other 
term of the complete canonical form will then have 0 as at least one factor and, hence, has 0 as value. 


9. Prove: Two Boolean functions are equal if and only if their respective canonical forms 
are identical, i.e. consist of the same terms. 

Clearly two functions are equal if their canonical forms consist of the same terms. Conversely 
if the two functions are equal, they must have the same value for each of the 2" possible assignments 
of 0 or 1 to the variables. Moreover, each of the 2" assignments for which the function has the 
value 1 determines a term of the canonical form of that function. Hence the two normal forms contain 
the same terms. 
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10. Find the Boolean function F defined by 


Table 17-7 

It is clear that the canonical form of F will consist of 5 terms while the dual canonical form 
will consist of 3 factors. We use the latter form. Then 

F - (*' u y' u z') n(iu»’uf)n(iuvu s') 

= (y' u *') n (x U y U z') = [y' n (as u »)] \J z' = (as n y') u z' 

11. Find the canonical form for F = (a: U y U z) n (x’ U/U z). 

Here F' = (as' n y' n z') u (as n y n *') 

(by the identity of Problem 3) and 

F = (F')' = (as n y n *) u (as n y' n z) u (» n y' n *') u (as' n y n *) u (as' n y' n ») u (as' n y n *') 
(the union of all terms of the complete canonical form not appearing in F'). 


12. Prove: For every a, b € C B, a U b is the least upper bound and an 6 is the greatest 
lower bound of a and b. 

That a U b is an upper bound of a and b follows from 

au(oub) = aub = 6 U (a U 6) 

Let c be any other upper bound of a and b. Then a Q c and b Q e sc 
6 U c = c. Now (a u 6) U c = au(6u<s) = aUe = e 

Thus, (a U b) C c and a U b is the least upper bound as required. 

Similarly, a n 6 is a lower bound of a and 6 since 

(a n b) U a — a and (a n b) u b = b 
Let c be any other lower bound of a and b. Then e Q a and cciso that e U 
Now c u (o n b) = (cue)n(sul) = a n 6 

Thus, c c (o n 6) and a n b is the greatest lower bound as required. 


s = e and 


. and c U 6 = b. 


13. Discuss the possible networks consisting of three switches r,s,t. 

There are four cases: 

(i) The switches are connected in series. The diagram is 


and the function is 


n»ni. 
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(ii) The switches are connected in parallel. The diagram is 



and the function is rUiUt. 

(iii) The series-parallel combination 



with function r n (« U t). 

(iv) The series-parallel combination 



with function r U (« n t). 



by a simpler one. 

The Boolean function for the given network ii 


The simpler network i; 


F = (r n t) u {* n («' u t) n [r' u (« n t')]} 
= (r n t) u {* n [(«' u()n (r' u t')]} 

= (rn t) u [r* n (« n *)] = (ru»)nt 



Fig. 17-9(6) 
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Supplementary Problems 

U I 0 1 

15. Show that the set {0,1} together with the operations as defined in 0 0 1 and 

is a Boolean algebra. 1 I 1 1 

16. Show that the set {a,b,e,d} together with the operations defined in 




is a Boolean algebra. 

17. Show that the Boolean algebra of Problem 16 is isomorphic to the algebra of all subsets of a set of 
two elements. 

18. Why is there no Boolean algebra having just three distinct elements? 

19. Let S be a subset of N, and for any a, b S S define a U b and a n b to be respectively the least 
common multiple and greatest common divisor of a and 6. Show 

(a) 9 is a Boolean algebra when S = {1, 2, 3, 6, 7,14, 21,42}. 

( b) ‘B is not a Boolean algebra when S = {1,2, 3,4,6,8,12,24}. 

20. Show that a U (a n b) = a n (o U b) without using Example 3, Page 223. State the dual of the 
identity and prove it. 

21. Prove: For every o,Je8, a U (o' n b) = a U 6. State the dual and prove it. 

22. Obtain the identities of Example 1, Page 222, by taking 6 = a in the identities of Problem 21. 

23. Obtain as in Problem 22 the identities of Example 2, Page 222. 

24. Prove: 0' = 1 and 1' = 0. (See 1.2-1.2', Chapter 1, Page 5.) 

Hint. Take a = 0 and 6 = 1 in the identity of Problem 21. 

25. Prove: (a n 6') U (6 n a') = (a U b) n (a' U b'). Write the dual. 

26. Prove: (o U 5) n (6 U e) n (e U o) = (a n 6) U (6 n c) U (c n a). What is its dual? 

27. Prove: If a U x = b U x and a U x' — b U x', then a = b. 

Hint. Consider (a U a) n (a U x') = (b U x) n (6 U x'). 

28. State the dual of Problem 27 and prove it. 

29. Prove: If a n 6 = a n c and a u b = a U c for any a,b,cG C B, then b = c. 

30. Simplify: (a) (oUi)n»'fl b' (e) [(*' n {/')' U z] n (x U y')' 

(b) (a n 6 n c) u a' u b' u c' (/) (a u 6') n (a' u b) n (a' u b') 

(e) (a n b) u [e n (o' u b')] (g) [(a U 6) n (e u 6')] u [6 n (o' u e')] 

(d) [a u (o' n 6)] n [6 u (b n c)] 

Ana. (a) 0, (b) 1, (c) (a n b) U c, (d) b, (e) x' n y, (/) a' n b', (g) a U b 

31. Prove: (oUb)n (o' U c) = (o' fl i) U (a n c) U (b n e) 

= (a u b) n (a' u c) n (b u c) = (a n e) u (o' n b) 

32. Find, by inspection, the complement of each of the following in two ways: 

(a) (* n y) u (* n y') (C) (x u y’ u z) n (* u y u z') n (x u y'u z') 

(b) (*n»'nz)u(z'ri!/n z') (d) (x u y' u z) n (*' u y u z) 

33. Express each of the following in both canonical form and dual canonical form in three variables: 

(a) *' u y', (b) (a; n y') u (x' n y), (c) (x u y) n (x' U z'), ( d ) x n z, (e) x n (y' u z) 
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Partial Ans. 

(а) (x n y' n z) u (* n y' n z') u (*' n y n z) u (*' n y > n z) u (*' n y n z') u (x' n y' n z') 

(б) (lUj(Uz)n(iuju z') n (*' u y' u z) n ( x' u y’ u z') 

(c) (* n y n z') u (* n y' n z') u (*' n y n z) u (x' n y n z') 

(d) (iujuz)n(*uj'uz)n(*uvu z') nfiuj'u z') n(j'ujuz)n (x' u y' u z) 

<«) (* u } u z) n (* u v 1 u z) n (* u y u z') n (* u y' u z') n (*' u j/'.u z) 

34. Express each of the following in both canonical and dual canonical form in the minimum number of 

variables: 

(а) * U (x' n y) (d) (xnynz) U [(* U y) n (* U z)] 

(б) [xn(y U z)] U [* n (y u z')] (e) (*Uj)n(zU z') n (x' U y') n (*' u z) 

(c) (zuyuz)n[(znv)u (*' n z)] (/) (m»)u(zn z') u (*' n z) 

Partial Ans. 

(а) (x n y) U (a; n y') u (»' n y) (d) (x U y U z) n (* U y U z') D (x U y' U z) 

(б) (>iUi/)n(zu y') (e) (* n y' n z) u (»' n y n z') 

(c) (as n y n z) u (* n y n z') u (*' n y n z) u (*' n n z) (/) (*ujuz)n(*'ui/uz')n(zui/'u z) 

35. Write the term of the complete canonical form in x, y, z having the value 1 when: 

(o) x = z = 0, y = 1 ; (6) * = y = 1 , z = 0; (c) * = 0, y = z = 1 . 

Ans. (a) x' n y n z', (b) x n y n z' 

36. Write the term of the complete canonical form in x,y,z,w having the value 1 when: 

(a) x = y = \, z = w = 0- (b) x - y - w = 0, z = 1; (c) * = 0, y = z = w = 1. 

Ans. (a) x n y n z' n w\ (c) x' n y n z n w 

37. Write the factor of the complete dual canonical form in x,y,z having the value 0 when: 

(a) x = z = 0, y = 1; (6) x = y = l, z — 0; (c) x = 0, y = z = l. 

Ans. (a) x U y' u z, (6) *' u y’ U z 

38. Write the factor of the complete dual canonical form in x,y,z,w having the value 0 when: 

(a) x = y = l, z = w = 0; (b) x = y = w = 0, z = 1; (e) x = 0, y = z = w = 1. 

Ans. (o) x' U y’ U z U w, (c) x U y’ U z' U w' 

39. Write the function in three variables whose value is 1 

(а) if and only if two of the variables are 1 and the other is 0, 

(б) if and only if more than one variable is 1. 

Ans. (a) (x n y n z') u (ar n y’ n z) u (*' n y n z), (6) [x n (y u z)] u (y n z) 

40. Write the function in three variables whose value is 0 

(a) if and only if two of the variables are 0 and the other is 1, 

( b) if and only if more than one variable is 0. 

Ans. The duals in Problem 39. 

41. Obtain in simplest form the Boolean functions F t , F 2 , .. .,F S defined as follows: 



Ans. F t = (x U y') n z', F z - x' U ( y' n z) U (y n z'), F s = (x U z) n [y' U (x n z)], F 7 = y' 





236 


BOOLEAN ALGEBRAS 


[CHAP. 17 


42. Show that F 1 and F s of Problem 41 can be found by inspection. 

43. Prove: (dj) If o Q b and a c e, then a C (b n e). 

(e t ) If a Q b then a C (6 U c) for any c€®. 

(/,) a C 6 if and only if b' c o'. 

44. Prove: If a, 6 e ® such that a Q b then, for any c£®, a U (6 n e) = b n (o u c). 

45. Prove: For every he®, 0 C 6 C 1. 

46. Construct a diagram similar to Fig. 17-1 for the Boolean algebra of all subsets of B = {a,b,e,d}. 

47. Diagram the networks represented by a U (o' n 6) and o u b and show by tables that they have 
the same closure properties. 

48. Diagram the networks (i) (oul)no'(l 6' and (ii) (o n 6 n e) U (o' U 6' u c'). Construct tables 
of closure properties for each. What do you conclude? 

49. Diagram each of the following networks 

(i) (o u b’) n (o' u 6) n (o' u b') (Hi) [(« u 6) n (c u 6')] u [b n (o' u o')] 

(ii) (o n b) u [c n (o' u 6')] (iv) (o n b n e) u o' u b' u c' 

Using the results obtained in Problem 30, diagram the simpler network for each. 

Partial Ans. 

- -CD-CD-CZh ■- 

■ -C5> 

50. Diagram each of the networks (r U *') n (r' U «) and (r n «) U (r' fl s') and show that they have 
the same closure properties. 

51. Diagram the simplest network having the closure properties of each of F X -F 8 in Problem 41. 
Partial Ans. 



-CD- ■ - -OCD- 

plify: i—• *' •—i 

.CCCE^ 

-EE3- 


To afford practice and also to check the results, it is suggested that (iii) and (iv) be solved by forming 
the table of closure properties and also by the procedure of Example 7, Page 226. 

Partial Ans. (i) and (ii) 
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56. From his garage M may enter either of two rooms. Obtain the network which will permit M to turn 
on or off the light in the garage either from the garage or one of the rooms irrespective of the position 
of the other two switches. 



57. Design a network which will permit independent control of a light from any one of four switches. 
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Highest common factor (see Greatest common 
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algebra, 219 
combination, 49, 145 
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Mathematical induction, 31, 37 
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identity, 166 
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non-singular, 172 
orthogonal, 207 
over f, 166 
product, 165 
rank, 172 
row rank, 172 
scalar product, 164 
singular, 172 
sum, 164 
symmetric, 206 
triangular, 171 
zero, 166 

Matrix polynomial, 198 
Maximal ideal, 106 
Minimum polynomial, 133, 177 
Modulus of a complex number, 77 
Monic polynomial, 126 
Multiples, 33, 43 
Multiplication, 
of complex numbers, 75 
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Multiplication (cont.) 
of integers, 39 

of linear transformations, 151 
of matrices, 165 
of matrix polynomials, 200 
of natural numbers, 31 
of polynomials, 125 
of rational numbers, 60 
of real numbers, 67 
Multiplicative inverse, 60, 68, 75 
Multiplicity of root, 129 

Natural numbers, 30 
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matrix, 172 
transformation, 151 
Norm, 119 
Normal divisor, 87 
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conjunctive, disjunctive, 224 
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Normal subgroup, 87 
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Numbers, 
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irrational, 69 
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prime, 49 
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real, 65 
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Ordered integral domain, 116 
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matrix, 207 
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vectors, 149 
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even, 24, 27 
group, 84 
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Perpendicular vectors, 149 
Polar form, 76 
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prime, 128 
ring of, 125 
roots of, 127 
zero of a, 127 
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Positive integers (see also Natural numbers), 39 
Powers (see also Exponents), 33, 43 
Prime, 49 
factors, 52 
field, 118 
ideal, 106 
integer, 49 
polynomial, 128 
relatively prime integer, 52 
Primitive roots of unity, 78 
Principal ideal, 105 
ring, 106 
Product, 
dot, 149 
inner, 149 
of cosets, 88 

of linear transformations, 151 
of mappings, 8 
of matrices, 165 
of matrix polynomials, 200 
of polynomials, 125 
of subgroups, 89 
scalar, 149, 151, 164 
set, 5 

ideal, 104 

orthogonal transformation, 208 
subgroup, 83 
subring, 102 
subset, 2 

Quadric surfaces, 208 
Quartic polynomial, 140 
Quaternion group, 100, 111, 196 
Quotient, 
group, 88 
ring, 106 

Range of a mapping, 6 
Rank, 

of a linear transformation, 151 
of a matrix, 172, 199 
Rational number, 60 
Real number, 65 

Real part (of a complex number), 76 
Real symmetric matrix, 206 
Reflexive relation, 15 
Regular permutation group, 93 
Relation, 15 
equivalence, 16 
Relatively prime integers, 52 
Remainder theorem, 128 
Residue classes, 53 
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Right, 
coset, 86 
ideal, 104 
Ring, 101 
Boolean, 112 
commutative, 103 
division, 117 
Euclidean, 107 
principal ideal, 106 
quotient, 106 
Roots, 

characteristic, 204 
latent, 204 

of cubic polynomials, 139 
of polynomials, 127 
of quartic polynomials, 140 
of unity, 78 
Row, 

equivalent, 170 
rank, 172, 199 
transformation, 170, 198 
vector, 164 

Scalar, 143 
multiplication, 143 
product, 149, 164 
Schwarz inequality, 149 
Set, 1 

denumerable, 8 
finite, 8 
infinite, 8 

Similar matrices, 205 
Simple, 
group, 88 
ring, 104 
zero, 129 

Simultaneous linear equations, 178 
Singular, 
matrix, 172 
transformation, 151 
Skew field, 117 
Span, 145 

Square, octic group of a, 92 
Subalgebra, 167 
Subdomain, 115 
Subfield, 118 
Subgroup, 83 
invariant, 87 
normal, 87 
proper, 83 
Subring, 102 
invariant, 104 
proper, 102 
Subset, 2 
Subspace, 144 
Subtraction, 41, 61, 69, 76 
Successor, 30 
Sum (see Addition) 

Symmetric, 
group, 84 


Symmetric (cont.) 
matrix, 206 
relation, 16 

Systems of linear equations, 178 
homogeneous, 181 
non-homogeneous, 179 

Total matrix algebra, 167 
Transformation, 
group of, 152 
linear, 149 
singular, 151 
orthogonal, 207 
Transitive relation, 16 
Transpose, 183 
Transposition, 24 
Triangle inequality, 149 
Triangular matrix, 171 
Trichotomy law, 32, 40, 61, 68 
Trigonometric representation of complex 
numbers, 76, 77 
Two-sided ideal, 104 

Union, 3 

Unique factorization theorem, 52, 117, 132 
Uniqueness, 
of identity, 20 
of inverses, 20 
Unit, 115 
Unity element, 19 
Universal set, 2 
Upper bound, 70 

Value, absolute, 42 
of complex numbers, 77 
Vector(s), 143 
characteristic, 203 
column, 164 
invariant, 203 
length of, 143, 149 
orthogonal, 149 
row, 164 
unit, 147 
Vector space, 144 
basis of a, 147, 207 
Venn diagram, 3 

Well-defined, 
operation, 20 
set, 1 

Well-ordered set, 18 

Zero, 40, 60 
divisors of, 103 
matrix, 166 

of a cubic polynomial, 139 
of a polynomial, 127 
of a quartic polynomial, 140 
polynomial, 125 
vector, 144 
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A 

Set, 1 

N(«) 

Norm, 119 


Matrix, 164 

0 

Zero matrix, 166 

A' 

Complement of set A, 3 

Q 

Set of all rational numbers, 1, 60 

A T 

Transpose of matrix A, 183 




Q + 

Set of all positive rational numbers, 61 

\A\ 

Determinant of matrix A, 182 

Q~ 

Set of all negative rational numbers, 61, 67 

M 

Equivalence class, 16 

R 

Set of all real numbers, 1, 65 

W 

Absolute value of a, 42 

% 

King, 101 

Kj] 

Matrix, 165 


Set of all polynomials in x 

a|6 

a is a divisor of 6, 49 


with coefficients in 125 

(a,b) 

Greatest common divisor, 50 

s„ 

Set of all permutations on n symbols, 22, 84 


Boolean algebra, 222 

of 

Division ring, 117 



T 

Linear transformation, 149 

C 

Set of all complex numbers, 75 



Dedekind cut, 66 

u 

Identity element of a group, 82 

Unity element of a ring with unity, 103 

3) 

Integral domain, 114 

V,V n (T) 

Vector space, 144 

Eli 

Basis matrix for eM n (f), 167 

g 

Complex number, 75 

T 

Field, 118 


Zero element of a ring, 102 

n*\ 

Set of all polynomials in x 
with coefficients in f, 127 

2 

Conjugate of complex number z, 75 



Mappings, 6 

Q 

Group, 82 

<*(*), P(x) 

Polynomials, 124 

g.l.b. 

Greatest lower bound, 70 

€ 

Is in, belongs to, 1 

1 

Set of all integers, 1, 38 


Unit vectors, 147 

/+ 

Set of all positive integers, 39 


Vectors, 143 

I- 

Set of all negative integers, 40 

r 

Zero vector, 143 

I/(m) 

Integers modulo in, 63 

{«, 6, e} 

Set, 1 

i 

Complex unit V—T, 76 

c.C 

Is contained in, 2 

s 

Ideal, 104 

0 

The empty set, 2 

JL 

Linear algebra, 219 

n 

Intersection, 3 



u 

Union, 3 

l.u.b. 
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